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NOVEL HUMAN GENES AND GENE EXPRESSION PRODUCTS 

FIELD OF THE INVENTION 

The present invention relates to novel polynucleotides of human origin 
and the encoded gene products. 

5 BACKGROUND OF THE INVENTION 

Identification of novel polynucleotides, particularly those that encode an 
expressed gene product, is important in the advancement of drug discovery, diagnostic 
technologies, and the understanding of the progression and nature of complex diseases 
such as cancer. Identification of genes expressed in different cell types isolated from 
10 sources that differ in disease state or stage, developmental stage, exposure to various 
environmental factors, the tissue of origin, the species from which the tissue was 
isolated, and the like is key to identifying the genetic factors that are responsible for the 
phenotypes associated with these various differences. 

This invention provides novel human polynucleotides, the polypeptides 
1 5 encoded by these polynucleotides, and the genes and proteins corresponding to these 
novel polynucleotides. 

SUMMARY OF THE INVENTION 

This invention relates to novel human polynucleotides and variants 
thereof, their encoded polypeptides and variants thereof, to genes corresponding to these 

20 polynucleotides and to proteins expressed by the genes. The invention also relates to 
diagnostics and therapeutics comprising such novel human polynucleotides, their 
corresponding genes or gene products, including probes, antisense nucleotides, and 
antibodies. The polynucleotides of the invention correspond to a polynucleotide 
comprising the sequence information of at least one of SEQ ID NOs: 1-335 1 . 

25 Various aspects and embodiments of the invention will be readily 

apparent to the ordinarily skilled artisan upon reading the description provided herein. 

DETAILED DESCRIPTION OF THE INVENTION 

The invention relates to polynucleotides comprising the disclosed 
nucleotide sequences, to full length cDNA, mRNA genomic sequences, and genes 
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corresponding to these sequences and degenerate variants thereof, and to polypeptides 
encoded by the polynucleotides of the invention and polypeptide variants. 

Polypeptide variants differ from wild type protein in having one or more 
amino acid substitutions that either enhance, add, or diminish a biological activity of the 
5 wild type protein. 

Six of the polypeptides disclosed herein encode new members of the MKK 
kinase family; the coding region is found within the nucleotide region in parentheses: SEQ 
ID NO:29 (nucleotides 295-421); SEQ ID NO:31 (298-397); SEQ ID NO: 196 (37-322); 
SEQ ID NO:3175 (nucleotides 14-164); SEQ ID NO:3190 (229-390); and SEQ ID 

10 NO:3281 (15-182). Twenty-four of the polypeptides encode new members of the family 
of transcription factor proteins having a basic region plus leucine zipper: SEQ ID NO:4I0 
(42-191); SEQ ID NO:552 (1 16-288); SEQ ID NO:768 (1 16-288); SEQ ID NO:822 (108- 
262); SEQ ID NO:836 (158-353); SEQ ID NO: 1288 (73-234); SEQ ID NO: 1 365 (69-257); 
SEQ ID NO:1540 (289-471); SEQ ID NO:1549 (200-391); SEQ ID NO:l556 (163-354); 

15 SEQ ID NO:1557 (207-398); SEQ ID NO:1563 (107-298); SEQ ID NO:1622 (180-365); 
SEQ ID NO:1630 (100-291); SEQ ID NO:I704 (184-372); SEQ ID NO:1808 (36-161); 
SEQ ID NO:1454 (49-209); SEQ ID NO:2363 (48-211); SEQ ID NO:2424 (43-194); 
SEQ ID NO:3147 (190-369); SEQ ID NO:3152 (129-320); SEQ ID NO:3158 (167- 
334); and SEQ ID NO:3208 (34-256). 

20 SEQ ID NOs:186 (175-395); 2591 (60-165); 3307 (43-321); and 3339 

(94-342) encode polypeptides having an SH2 domain, and SEQ ID NOs:234 (23-121), 
1832 (18-173), and 1835 (57-206) encode polypeptides having an SH3 domain. Nine 
polypeptides encode new members of the family of proteins having Ank repeat regions: 
SEQ ID NO:187 (358-432); SEQ ID NO:1268 (238-315); SEQ ID NO:I804 (301-378); 

25 SEQ ID NO:1819 (278-355); SEQ ID NO:1839 (224-307); SEQ ID NO:1830 (184-267); 
SEQ ID NO:2562 (18-101); SEQ ID NO:3015 (131-214); and SEQ ID NO:3267 (97- 
180). 

The following eleven polynucleotides encode polypeptides having a C2H2 
type zinc finger: SEQ ID NOs:308 (1 10-172); 807 (339-392); 1324 (294-356); 1503 (154- 

30 216); 1527 (156-212); 1674 (196-258); 1779 (64-126); 1801 (295-351); 3081 (190-252); 
3193 (293-355); and 3306 (161-223). Eight polynucleotides encode polypeptides of the 
family of ATPases: SEQ ID NOs:43I (71-428); 639 (157-561); 2135 (2-401); 2684 (9- 
461); 2859 (100-320); 3178 (45-386); 3197 (281-343) and 3266 (8-139). Polypeptides 
having a fibronectin type III domain are encoded by SEQ ID NO:746 (209-427) and 1 192 

35 (186-416). Polypeptides having an EF-hand domain are encoded by SEQ ID NO:820 (341- 
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406); 1755 (281-367) and 3285(16-102). Six polypeptides of the protein kinase family are 
encoded by SEQ ID NOs:1157 (41-444); 1478 (54-437), 1496 (241-520); 2286 (12-182); 
2969 (5-387); and 3190 (1 18-390). 

LIM domain-containing polypeptides are encoded by SEQ ID NO: 1269 
5 (79-240); 1309 (248-404); 1360 (222-377); and 1386 (243-398). Two polypeptides of the 
family having a C2 domain (protein kinase C-like) are encoded by SEQ ID NO: 1 325 (1- 
234) and 2282(183-353). Polypeptides having a WD domain, G-beta repeat motif are 
encoded by SEQ ID NOs:1336 (66-164); 1380 (42-140); 171 1 (263-361); 1762 (236-334); 
1909 (160-258); 2218 (127-225); 3047 (191-292); 3108 (275-367) and 3292 (208-300). 

10 SEQ ID NOM410 (222-350) encodes a member of the trypsin family. SEQ 

ID NOs:1417 (8-354); 2281 (20-387) and 2310 (20-371) encode members of the protein 
tyrosine phosphatase family. SEQ ID NOs:1464 (4-180) and 1514 (2-252) encode 
members of the family having an RNA recognition motif (also known as RRM, RBD, or 
RNP domain). SEQ ID NOs:1496 (241-520) and 3297(7-153) encode helicases having a 

15 conserved C-terminal domain. SEQ ID NO: 1538 (9-635) encodes a member of the wnt 
family of developmental signaling proteins. 

Three polynucleotides encode polypeptides having a homeobox domain: 
SEQ ID NOs:I676 (9-86); 1820 (123-299); and 1821 (127-303). A novel thioredoxin is 
encoded by SEQ ID NO:1677 (316-369). Two novel members of the ras family are 

20 encoded by SEQ ID NO: 1688(1 09-4 10) and 3258(138-394). A novel polypeptide having a 
phosphatidylinositol-specific phospholipase C Y-domain is encoded by SEQ ID NO:1707 
(92-439). A novel serine carboxypeptidase is encoded by SEQ ID NO: 1744 (238-433). A 
novel polypeptide having N-terminal homology in the Ets domain is encoded by SEQ ID 
NO: 181 1 (184-315). A novel polypeptide having a bromodomain is encoded by SEQ ID 

25 NO:1814 (127-294). A novel polypeptide having a double-stranded RNA binding motif is 
encoded by SEQ ID NOM818 (9-146). A novel polypeptide having a G-protein alpha 
subunit is encoded by SEQ IDNO:1846 (12-398). 

SEQ ID NOs:1911 (35-151) and 1980 (60-197) encode polypeptides 
having a C3HC4 type zinc finger domain (RING finger). SEQ ID NO:2065 (253-306) 

30 encodes a polypeptide having a CCHC zinc finger domain. SEQ ID NO:2216 (90-179) 
encodes a polypeptide having a WW/rsp5/WWP domain. SEQ ID NO:2428 (25-350) 
encodes a polypeptide member of the dual specificity phosphatase family, having a 
catalytic domain. 

SEQ ID NOs:2577 (0-311); 3183 (14-215); and 3195 (0-215) encode 
35 members of the 4 transmembrane segment integral membrane protein family. SEQ ID 

3 
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NOs:2826 (1 16-400) and 2871 (198-392) encode polypeptides of the DEAD and DEAH 
box helicase family. SEQ ID NO:2944 (18-281) encodes a polypeptide having a 
calpain large subunit, domain III. 

SEQ ID NO:3274 (11-187) encodes a eukaryotic transcription factor 
5 with a fork head domain. SEQ ID NO:3345 (65-271) encodes a polypeptide having a 
PDZ domain, and SEQ ID NO:3351 (124-270) encodes a polypeptide in the family of 
phorbol esters/glycerol binding proteins. 

Described below are polynucleotide compositions encompassed by the 
invention, methods for obtaining cDNA or genomic DNA encoding a full-length gene 

10 product, expression of these polynucleotides and genes, identification of structural motifs 
of the polynucleotides and genes, identification of the function of a gene product encoded 
by a gene corresponding to a polynucleotide of the invention, use of the provided 
polynucleotides as probes and in mapping and in tissue profiling, use of the corresponding 
polypeptides and other gene products to raise antibodies, and use of the polynucleotides 

1 5 and their encoded gene products for therapeutic and diagnostic purposes. 

Polynucleotide Compositions 

The scope of the invention with respect to polynucleotide compositions 
includes, but is not necessarily limited to, polynucleotides having a sequence set forth in 
any one of SEQ ID NOs:l-3351; polynucleotides obtained from the biological materials 

20 described herein or other biological sources (particularly human sources) by 
hybridization under stringent conditions (particularly conditions of high stringency); 
genes corresponding to the provided polynucleotides; variants of the provided 
polynucleotides and their corresponding genes, particularly those variants that retain a 
biological activity of the encoded gene product (e.g., a biological activity ascribed to a 

25 gene product corresponding to the provided polynucleotides as a result of the 
assignment of the gene product to a protein family(ies) and/or identification of a 
functional domain present in the gene product). Other nucleic acid compositions 
contemplated by and within the scope of the present invention will be readily apparent 
to one of ordinary skill in the art when provided with the disclosure here. 

30 "Polynucleotide" and "nucleic acid" as used herein with reference to nucleic acids of 
the composition is not intended to be limiting as to the length or structure of the nucleic 
acid unless specifically indicated. 

The invention features polynucleotides that are expressed in human 
tissue, specifically human colon, breast, and/or lung tissue. Novel nucleic acid 
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compositions of the invention comprise a sequence set forth in any one of SEQ ID 
NOs: 1-3351 or an identifying sequence thereof. An "identifying sequence" is a 
contiguous sequence of residues at least about 1 0 nt to about 20 nt in length, usually at 
least about 50 nt to about 100 nt in length, that uniquely identifies a polynucleotide 
5 sequence, e.g., exhibits less than 90%, usually less than about 80% to about 85% 
sequence identity to any contiguous nucleotide sequence of more than about 20 nt. 
Thus, the subject novel nucleic acid compositions include full length cDNAs or mRNAs 
that encompass an identifying sequence of contiguous nucleotides from any one of SEQ 
ID NOs: 1-3351. 

10 The polynucleotides of the invention also include polynucleotides having 

sequence similarity or sequence identity. Nucleic acids having sequence similarity are 
detected by hybridization under low stringency conditions, for example, at 50°C and 
10XSSC (0.9 M saline/0.09 M sodium citrate) and remain bound when subjected to 
washing at 55°C in 1XSSC. Sequence identity can be determined by hybridization 

15 under stringent conditions, for example, at 50°C or higher and 0.1XSSC (9 mM 
saline/0.9 mM sodium citrate). Hybridization methods and conditions are well known 
in the art, see, e.g., U.S. Patent No. 5,707,829. Nucleic acids that are substantially 
identical to the provided polynucleotide sequences, e.g., allelic variants, genetically 
altered versions of the gene, etc., bind to the provided polynucleotide sequences (SEQ 

20 ID NOs: 1-335 1) under stringent hybridization conditions. By using probes, particularly 
labeled probes of DNA sequences, one can isolate homologous or related genes. The 
source of homologous genes can be any species, e.g., primate species, particularly 
human; rodents, such as rats and mice; canines, felines, bovines, ovines, equines, yeast, 
nematodes, etc. 

25 Preferably, hybridization is performed using at least 15 contiguous 

nucleotides (nt) of at least one of SEQ ID NOs: 1-3351. That is, when at least 15 
contiguous nt of one of the disclosed SEQ ID NOs. is used as a probe, the probe will 
preferentially hybridize with a nucleic acid comprising the complementary sequence, 
allowing the identification and retrieval of the nucleic acids that uniquely hybridize to 

30 the selected probe. Probes from more than one SEQ ID NO. can hybridize with the 
same nucleic acid if the cDNA from which they were derived corresponds to one 
mRNA. Probes of more than 15 nt can be used, e.g., probes of from about 18 nt to 
about 1 00 nt, but 1 5 nt represents sufficient sequence for unique identification. 

The polynucleotides of the invention also include naturally occurring 

35 variants of the nucleotide sequences (e.g., degenerate variants, allelic variants). 
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Variants of the polynucleotides of the invention are identified by hybridization of 
putative variants with nucleotide sequences disclosed herein, preferably by 
hybridization under stringent conditions. For example, by using appropriate wash 
conditions, variants of the polynucleotides of the invention can be identified where the 
5 allelic variant exhibits at most about 25-30% base pair (bp) mismatches relative to the 
selected polynucleotide probe. In general, allelic variants contain 15-25% bp 
mismatches, and can contain as little as even 5-15%, or 2-5%, or 1-2% bp mismatches, 
as well as a single bp mismatch. 

The invention also encompasses homologs corresponding to the 

10 polynucleotides of SEQ ID NOs: 1-3351, where the source of homologous genes can be 
any mammalian species, e.g., primate species, particularly human; rodents, such as rats; 
canines, felines, bovines, ovines, equines, yeast, nematodes, etc. Between mammalian 
species, e.g., human and mouse, homologs generally have substantial sequence 
similarity, e.g., at least 75% sequence identity, usually at least 90%, more usually at 

15 least 95% between nucleotide sequences. Sequence similarity is calculated based on a 
reference sequence, which may be a subset of a larger sequence, such as a conserved 
motif, coding region, flanking region, etc. A reference sequence will usually be at least 
about 18 contiguous nt long, more usually at least about 30 nt long, and may extend to 
the complete sequence that is being compared. Algorithms for sequence analysis are 

20 known in the art, such as BLAST, described in Altschul et al., J, Mol. Biol. (1990) 
275:403-10. 

In general, variants of the invention have a sequence identity greater than 
at least about 65%, preferably at least about 75%, more preferably at least about 85%, 
and can be greater than at least about 90%, 91%, 92%, 93%, 94%, 95%, or 96%, most 

25 preferably 97%, 98% or 99%. For the purposes of this invention, a preferred method of 
calculating percent identity is the Smith- Waterman algorithm, using the following. 
Global DNA sequence identity must be greater than 65% as determined by the Smith- 
Waterman homology search algorithm as implemented in MPSRCH program (Oxford 
Molecular) using an affine gap search with the following search parameters: gap open 

30 penalty, 12; and gap extension penalty, 1. 

The subject nucleic acids can be cDNAs or genomic DNAs, as well as 
fragments thereof, particularly fragments that encode a biologically active gene product 
and/or are useful in the methods disclosed herein (e.g., in diagnosis, as a unique 
identifier of a differentially expressed gene of interest, etc.). The term "cDNA" as used 

35 herein is intended to include all nucleic acids that share the arrangement of sequence 
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elements found in native mature mRNA species, where sequence elements are exons 
and 3' and 5' non-coding regions. Normally mRNA species have contiguous exons, 
with the intervening introns, when present, being removed by nuclear RNA splicing, to 
create a continuous open reading frame encoding a polypeptide of the invention. 
5 A genomic sequence of interest comprises the nucleic acid present 

between the initiation codon and the stop codon, as defined in the listed sequences, 
including all of the introns that are normally present in a native chromosome. It can 
further include the 3' and 5' untranslated regions found in the mature mRNA. It can 
further include specific transcriptional and translational regulatory sequences, such as 

10 promoters, enhancers, etc., including about 1 kb, but possibly more, of flanking 
genomic DNA at either the 5 ? and 3 5 end of the transcribed region. The genomic DNA 
can be isolated as a fragment of 100 kbp or smaller; and substantially free of flanking 
chromosomal sequence. The genomic DNA flanking the coding region, either 3' and 
5', or internal regulatory sequences as sometimes found in introns, contains sequences 

1 5 required for proper tissue, stage-specific, or disease-state specific expression. 

The nucleic acid compositions of the subject invention can encode all or 
a part of the subject polypeptides. Double or single stranded fragments can be obtained 
from the DNA sequence by chemically synthesizing oligonucleotides in accordance 
with conventional methods, by restriction enzyme digestion, by PCR amplification, etc. 

20 Isolated polynucleotides and polynucleotide fragments of the invention comprise at 
least about 10, about 15, about 20, about 35, about 50, about 100, about 150 to about 
200, about 250 to about 300, or about 350 contiguous nt selected from the 
polynucleotide sequences as shown in SEQ ID NOs: 1-3351. The fragments also 
include those of lengths intermediate to the specifically mentioned lengths, such as 35, 

25 36, 37, 38, 39, etc.; 150, 151, 152, 153, 154, etc. For the most part, fragments will be of 
at least 1 5 nt, usually at least 1 8 nt or 25 nt, and up to at least about 50 contiguous nt in 
length or more. In a preferred embodiment, the polynucleotide molecules comprise a 
contiguous sequence of at least 12 nt selected from the group consisting of the 
polynucleotides shown in SEQ ID NOs: 1-3 351. 

30 Probes specific to the polynucleotides of the invention can be generated 

using the polynucleotide sequences disclosed in SEQ ID NOs: 1-3351. The probes are 
preferably at least about a 12, 15, 16, 18, 20, 22, 24, or 25 nt fragment of a 
corresponding contiguous sequence of SEQ ID NOs: 1-3351, and can be less than 2, 1, 
0.5, 0.1, or 0.05 kb in length. The probes can be synthesized chemically or can be 

35 generated from longer polynucleotides using restriction enzymes. The probes can be 

7 
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labeled, for example, with a radioactive, biotinylated, or fluorescent tag. Preferably, 
probes are designed based upon an identifying sequence of a polynucleotide of one of 
SEQ ID NOs: 1-3351. More preferably, probes are designed based on a contiguous 
sequence of one of the subject polynucleotides that remain unmasked following 
5 application of a masking program for masking low complexity (e.g., XBLAST) to the 
sequence., i.e., one would select an unmasked region, as indicated by the 
polynucleotides outside the poly-n stretches of the masked sequence produced by the 
masking program. 

The polynucleotides of the subject invention are isolated and obtained in 
10 substantial purity, generally as other than an intact chromosome. Usually, the 
polynucleotides, either as DNA or RNA, will be obtained substantially free of other 
naturally-occurring nucleic acid sequences, generally being at least about 50%, usually 
at least about 90% pure and are typically "recombinant", e.g., flanked by one or more 
nucleotides with which it is not normally associated on a naturally occurring 
1 5 chromosome. 

The polynucleotides of the invention can be provided as a linear 
molecule or within a circular molecule, and can be provided within autonomously 
replicating molecules (vectors) or within molecules without replication sequences. 
Expression of the polynucleotides can be regulated by their own or by other regulatory 

20 sequences known in the art. The polynucleotides of the invention can be introduced 
into suitable host cells using a variety of techniques available in the art, such as 
transferrin polycation-mediated DNA transfer, transfection with naked or encapsulated 
nucleic acids, liposome-mediated DNA transfer, intracellular transportation of DNA- 
coated latex beads, protoplast fusion, viral infection, electroporation, gene gun, calcium 

25 phosphate-mediated transfection, and the like. 

The subject nucleic acid compositions can be used to, for example, 
produce polypeptides, as probes for the detection of mRNA of the invention in 
biological samples (e.g., extracts of human cells) to generate additional copies of the 
polynucleotides, to generate ribozymes or antisense oligonucleotides, and as single 

30 stranded DNA probes or as triple-strand forming oligonucleotides. The probes 
described herein can be used to, for example, determine the presence or absence of the 
polynucleotide sequences as shown in SEQ ID NOs: 1-3351 or variants thereof in a 
sample. These and other uses are described in more detail below. 
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Use of Polynucleotides to Obtain Full-Length cDNA. Gene, and Promoter Region 

Full-length cDNA molecules comprising the disclosed polynucleotides 
are obtained as follows. A polynucleotide having a sequence of one of SEQ ID NOs: 1- 
3351, or a portion thereof comprising at least 12, 15, 18, or 20 nt, is used as a 
5 hybridization probe to detect hybridizing members of a cDNA library using probe 
design methods, cloning methods, and clone selection techniques such as those 
described in U.S. Patent No. 5,654,173. Libraries of cDNA are made from selected 
tissues, such as normal or tumor tissue, or from tissues of a mammal treated with, for 
example, a pharmaceutical agent. Preferably, the tissue is the same as the tissue from 

10 which the polynucleotides of the invention were isolated, as both the polynucleotides 
described herein and the cDNA represent expressed genes. Most preferably, the cDNA 
library is made from the biological material described herein in the Examples. The 
choice of cell type for library construction can be made after the identity of the protein 
encoded by the gene corresponding to the polynucleotide of the invention is known. 

15 This will indicate which tissue and cell types are likely to express the related gene, and 
thus represent a suitable source for the mRNA for generating the cDNA. As described 
in the Examples, cDNA of the invention was isolated from specific cell or tissue types, 
and such cells and tissues are preferable for obtaining related nucleic acids. 

Techniques for producing and probing nucleic acid sequence libraries are 

20 described, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, 
2nd Ed, (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY. The cDNA can be 
prepared by using primers based on sequence from SEQ ID NOs: 1*3351. In one 
embodiment, the cDNA library can be made from only poly-adenylated mRNA. Thus, 
poly-T primers can be used to prepare cDNA from the mRNA. 

25 Members of the library that are larger than the provided polynucleotides, 

and preferably that encompass the complete coding sequence of the native message, are 
obtained. In order to confirm that the entire cDNA has been obtained, RNA protection 
experiments are performed as follows. Hybridization of a full-length cDNA to an 
mRNA will protect the RNA from RNase degradation. If the cDNA is not full length, 

30 then the portions of the mRNA that are not hybridized will be subject to RNase 
degradation. This is assayed, as is known in the art, by changes in electrophoretic 
mobility on polyacrylamide gels, or by detection of released monoribonucleotides. 
Sambrook et al., Molecular Cloning: A Laboratory Manual 2nd Ed, (1989) Cold 
Spring Harbor Press, Cold Spring Harbor, NY. In order to obtain additional sequences 

9 
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5* to the end of a partial cDNA, 5* RACE (PCR Protocols: A Guide to Methods and 
Applications, (1990) Academic Press, Inc.) can be performed. 

Genomic DNA is isolated using the provided polynucleotides in a 
manner similar to the isolation of full-length cDNAs. Briefly, the provided 
5 polynucleotides, or portions thereof, are used as probes to libraries of genomic DNA. 
Preferably, the library is obtained from the cell type that was used to generate the 
polynucleotides of the invention, but this is not essential. Most preferably, the genomic 
DNA is obtained from the biological material described herein in the Examples. Such 
libraries can be in vectors suitable for carrying large segments of a genome, such as PI 

10 or YAC, as described in detail in Sambrook et al., 9.4-9.30. In addition, genomic 
sequences can be isolated from human BAC libraries, which are commercially available 
from Research Genetics, Inc., Huntsville, Alabama, USA, for example. In order to 
obtain additional 5' or 3* sequences, chromosome walking is performed, as described in 
Sambrook et al., such that adjacent and overlapping fragments of genomic DNA are 

15 isolated. These are mapped and pieced together, as is known in the art, using restriction 
digestion enzymes and DNA ligase. 

Using the polynucleotide sequences of the invention, corresponding full- 
length genes can be isolated using both classical and PCR methods to construct and 
probe cDNA libraries. Using either method, Northern blots, preferably, are performed 

20 on a number of cell types to determine which cell lines express the gene of interest at 
the highest level. Classical methods of constructing cDNA libraries are taught in 
Sambrook et al., supra. With these methods, cDNA can be produced from mRNA and 
inserted into viral or expression vectors. Typically, libraries of mRNA comprising 
poly(A) tails can be produced with poly(T) primers. Similarly, cDNA libraries can be 

25 produced using the instant sequences as primers. 

PCR methods are used to amplify the members of a cDNA library that 
comprise the desired insert. In this case, the desired insert will contain sequence from 
the full length cDNA that corresponds to the instant polynucleotides. Such PCR 
methods include gene trapping and RACE methods as described in Gruber et al., WO 

30 95/04745 and Gruber et al., U.S. Patent No. 5,500,356. Kits are commercially available 
to perform gene trapping experiments from, for example, Life Technologies, 
Gaithersburg, Maryland, USA. In preferred embodiments of RACE, a common primer 
is designed to anneal to an arbitrary adaptor sequence ligated to cDNA ends (Apte and 
Siebert, Biotechniques (1993) 75:890-893; Edwards et al., Nuc. Acids Res. (1991) 

35 79:5227-5232). When a single gene-specific RACE primer is paired with the common 

fo 
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primer, preferential amplification of sequences between the single gene specific primer 
and the common primer occurs. Commercial cDNA pools modified for use in RACE 
are available. 

The promoter region of a gene generally is located 5' to the initiation site 
5 for RNA polymerase II. Hundreds of promoter regions contain the "TATA" box, a 
sequence such as TATTA or TATAA, which is sensitive to mutations. The promoter 
region can be obtained by performing 5' RACE using a primer from the coding region 
of the gene. Alternatively, the cDNA can be used as a probe for the genomic sequence, 
and the region 5' to the coding region is identified by "walking up." If the gene is 
1 0 highly expressed or differentially expressed, the promoter from the gene can be of use 
in a regulatory construct for a heterologous gene. 

Once the full-length cDNA or gene is obtained, DNA encoding variants 
can be prepared by site-directed mutagenesis, described in detail in Sambrook et al., 
15.3-15.63. The choice of codon or nucleotide to be replaced can be based on disclosure 
15 herein on optional changes in amino acids to achieve altered protein structure and/or 
function. 

As an alternative method to obtaining DNA or RNA from a biological 
material, nucleic acid comprising nucleotides having the sequence of one or more 
polynucleotides of the invention can be synthesized. Thus, the invention encompasses 

20 nucleic acid molecules ranging in length from 15 nt (corresponding to at least 15 
contiguous nt of one of SEQ ID NOs:l-3351) up to a maximum length suitable for one 
or more biological manipulations, including replication and expression, of the nucleic 
acid molecule. The invention includes but is not limited to (a) nucleic acid having the 
size of a full gene, and comprising at least one of SEQ ID NOs:l-3351; (b) the nucleic 

25 acid of (a) also comprising at least one additional polynucleotide or gene, operably 
linked to permit expression of a fusion protein; (c) an expression vector comprising (a) 
or (b); (d) a plasmid comprising (a) or (b) ; and (e) a recombinant viral particle 
comprising (a) or (b). Once provided with the polynucleotides disclosed herein, 
construction or preparation of (a) - (e) are well within the skill in the art. 

30 The sequence of a nucleic acid comprising at least 1 5 contiguous nt of at 

least any one of SEQ ID NOs: 1-3351, preferably the entire sequence of at least any one 
of SEQ ID NOs: 1-3351, is not limited and can be any sequence of A, T, G, and/or C 
(for DNA) and A, U, G, and/or C (for RNA) or modified bases thereof, including 
inosine and pseudouridine. The choice of sequence will depend on the desired function 

35 and can be dictated by coding regions desired, the intron-like regions desired, and the 
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regulatory regions desired. Where the entire sequence of any one of SEQ ID NOs:l- 
3351 is within the nucleic acid, the nucleic acid obtained is referred to herein as a 
polynucleotide comprising the sequence of any one of SEQ ID NOs: 1 -335 1 . 

Expression of Polypeptide Encoded by Full-Length cDNA or Full-Length Gene 
5 The provided polynucleotides (e.g., a polynucleotide having a sequence 

of one of SEQ ID NOs: 1-3351), the corresponding cDNA, or the fiill-length gene is 
used to express a partial or complete gene product. Constructs of polynucleotides 
having sequences of SEQ ID NOs: 1-3351 can be generated synthetically. Alternatively, 
single-step assembly of a gene and entire plasmid from large numbers of 

10 oligodeoxyribonucleotides is described by, e.g., Stemmer et al. ? Gene (Amsterdam) 
(1995) 7<W(7):49-53. In this method, assembly PCR (the synthesis of long DNA 
sequences from large numbers of oligodeoxyribonucleotides (oligos)) is described. The 
method is derived from DNA shuffling (Stemmer, Nature (1994) 570:389-391), and 
does not rely on DNA ligase, but instead relies on DNA polymerase to build 

1 5 increasingly longer DNA fragments during the assembly process. 

Appropriate polynucleotide constructs are purified using standard 
recombinant DNA techniques as described in, for example, Sambrook et al., Molecular 
Cloning: A Laboratory Manual, 2nd Ed, (1989) Cold Spring Harbor Press, Cold Spring 
Harbor, NY, and under current regulations described in United States Dept. of HHS, 

20 National Institute of Health (NIH) Guidelines for Recombinant DNA Research. The 
gene product encoded by a polynucleotide of the invention is expressed in any 
expression system, including, for example, bacterial, yeast, insect, amphibian and 
mammalian systems. Vectors, host cells and methods for obtaining expression in same 
are well known in the art. Suitable vectors and host cells are described in U.S. Patent 

25 No. 5,654,173. 

Polynucleotide molecules comprising a polynucleotide sequence 
provided herein are generally propagated by placing the molecule in a vector. Viral and 
non-viral vectors are used, including plasmids. The choice of plasmid will depend on 
the type of cell in which propagation is desired and the purpose of propagation. Certain 
30 vectors are useful for amplifying and making large amounts of the desired DNA 
sequence. Other vectors are suitable for expression in cells in culture. Still other 
vectors are suitable for transfer and expression in cells in a whole animal or person. The 
choice of appropriate vector is well within the skill of the art. Many such vectors are 



WO0102568 ffile://C:\Pocuments and Settings\Lawrence\Loca1 SettingsVTemporary Internet Files\OLK11\WO0102568.cpc1 



WO 01/02568 



PCT/US00/18374 



Page 14 of 1046 



available commercially. Methods for preparation of vectors comprising a desired 
sequence are well known in the art. 

The polynucleotides set forth in SEQ ID NOs: 1-3351 or their 
corresponding full-length polynucleotides are linked to regulatory sequences as 
5 appropriate to obtain the desired expression properties. These can include promoters 
(attached either at the 5' end of the sense strand or at the 3 f end of the antisense strand), 
enhancers, terminators, operators, repressors, and inducers. The promoters can be 
regulated or constitutive. In some situations it may be desirable to use conditionally 
active promoters, such as tissue-specific or developmental stage-specific promoters. 

10 These are linked to the desired nucleotide sequence using the techniques described 
above for linkage to vectors. Any techniques known in the art can be used. 

When any appropriate host cells or organisms are used to replicate . 
and/or express the polynucleotides or nucleic acids of the invention, the resulting 
replicated nucleic acid, RNA, expressed protein or polypeptide, is within the scope of 

1 5 the invention as a product of the host cell or organism. The product is recovered by any 
appropriate means known in the art. 

Once the gene corresponding to a selected polynucleotide is identified, 
its expression can be regulated in the cell to which the gene is native. For example, an 
endogenous gene of a cell can be regulated by an exogenous regulatory sequence as 

20 disclosed in U.S. Patent No. 5,641,670. 



Identification of Functional and Structural Motifs of Novel Genes 

Translations of the nucleotide sequence of the provided polynucleotides, 
cDNAs or full genes can be aligned with individual known sequences. Similarity with 
individual sequences can be used to determine the activity of the polypeptides encoded 
25 by the polynucleotides of the invention. Also, sequences exhibiting similarity with 
more than one individual sequence can exhibit activities that are characteristic of either 
or both individual sequences. 

The full length sequences and fragments of the polynucleotide sequences 
of the nearest neighbors can be used as probes and primers to identify and isolate the 
30 full length sequence corresponding to provided polynucleotides. The nearest neighbors 
can indicate a tissue or cell type to be used to construct a library for the full-length 
sequences corresponding to the provided polynucleotides. 

Typically, a selected polynucleotide is translated in all six frames to 
determine the best alignment with the individual sequences. The sequences disclosed 

/3 
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herein in the Sequence Listing are in a 5' to 3' orientation and translation in three 
frames can be sufficient. These amino acid sequences are referred to, generally, as 
query sequences, which will be aligned with the individual sequences. Databases with 
individual sequences are described in "Computer Methods for Macromolecular 
5 Sequence Analysis" Methods in Enzymology (1996) 266, Doolittle, Academic Press, 
Inc., a division of Harcourt Brace & Co., San Diego, California, USA. Databases 
include Genbank, EMBL, and DNA Database of Japan (DDBJ). 

Query and individual sequences can be aligned using the methods and 
computer programs described above, and include BLAST, available over the world 

10 wide web at http://www.ncbi.nlm.nhi.gov/BLAST. Another alignment algorithm is 
Fasta, available in the Genetics Computing Group (GCG) package, Madison, 
Wisconsin, USA, a wholly owned subsidiary of Oxford Molecular Group, Inc. Other 
techniques for alignment are described in Doolittle, supra. Preferably, an alignment 
program that permits gaps in the sequence is utilized to align the sequences. The 

15 Smith- Waterman is one type of algorithm that permits gaps in sequence alignments. 
See Meth Mol Biol (1997) 70: 173-187. Also, the GAP program using the Needleman 
and Wunsch alignment method can be utilized to align sequences. An alternative search 
strategy uses MPSRCH software, which runs on a MASPAR computer. MPSRCH uses 
a Smith- Waterman algorithm to score sequences on a massively parallel computer. 

20 This approach improves ability to identify sequences that are distantly related matches, 
and is especially tolerant of small gaps and nucleotide sequence errors. Amino acid 
sequences encoded by the provided polynucleotides can be used to search both protein 
and DNA databases. 

High Similarity . In general, in alignment results considered to be of high 

25 similarity, the percent of the alignment region length is typically at least about 55% of 
total length query sequence; more typically, at least about 58%; even more typically; at 
least about 60% of the total residue length of the query sequence. Usually, percent 
length of the alignment region can be as much as about 62%; more usually, as much as 
about 64%; even more usually, as much as about 66%. Further, for high similarity, the 

30 region of alignment, typically, exhibits at least about 75% of sequence identity; more 
typically, at least about 78%; even more typically; at least about 80% sequence identity. 
Usually, percent sequence identity can be as much as about 82%; more usually, as much 
as about 84%; even more usually, as much as about 86%. 

The p value is used in conjunction with these methods. If high similarity 

35 is found, the query sequence is considered to have high similarity with a profile 
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sequence when the p value is less than or equal to about 10" 2 ; more usually; less than or 
equal to about 10" 3 ; even more usually; less than or equal to about 10* 4 . More typically, 
the p value is no more than about 10' 5 ; more typically; no more than or equal to about 
10" !0 ; even more typically; no more than or equal to about 10' 15 for the query sequence 
5 to be considered high similarity. 

Similarity Determined by Sequence Identity Alone . Sequence identity 
alone can be used to determine similarity of a query sequence to an individual sequence 
and can indicate the activity of the sequence. Such an alignment, preferably, permits 
gaps to align sequences. Typically, the query sequence is related to the profile sequence 

10 if the sequence identity over the entire query sequence is at least about 15%; more 
typically, at least about 20%; even more typically, at least about 25%; even more 
typically, at least about 50%. Sequence identity alone as a measure of similarity is most 
useful when the query sequence is usually, at least 80 residues in length; more usually, 
90 residues; even more usually, at least 95 amino acid residues in length. More 

15 typically, similarity can be concluded based on sequence identity alone when the query 
sequence is preferably 100 residues in length; more preferably, 120 residues in length; 
even more preferably, 150 amino acid residues in length. 

Alignments with Profile and Multiple Aligned Sequences . Translations 
of the provided polynucleotides can be aligned with amino acid profiles that define 

20 either protein families or common motifs. Also, translations of the provided 
polynucleotides can be aligned to multiple sequence alignments (MSA) comprising the 
polypeptide sequences of members of protein families or motifs. Similarity or identity 
with profile sequences or MSAs can be used to determine the activity of the gene 
products (e.g., polypeptides) encoded by the provided polynucleotides or corresponding 

25 cDNA or genes. For example, sequences that show an identity or similarity with a 
chemokine profile or MSA can exhibit chemokine activities. 

Profiles can be designed manually by (1) creating an MSA, which is an 
alignment of the amino acid sequence of members that belong to the family and (2) 
constructing a statistical representation of the alignment. Such methods are described, 

30 for example, in Birney et al., Nucl Acid Res. (1 996) 24(14): 2730-2739. MSAs of some 
protein families and motifs are publicly available. MSAs are described also in 
Sonnhammer et al., Proteins (1997) 28: 405-420. A brief description of MSAs is 
reported in Pascarella et al., Prot. Eng. (1996) 9(5):249-251. Techniques for building 
profiles from MSAs are described in Sonnhammer et al., supra; Birney et al., supra; 

15 
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and "Computer Methods for Macromolecular Sequence Analysis," Methods in 
Enzymology (1996) 266, Doolittle, Academic Press, Inc., San Diego, California, USA. 



determined by (a) comparing the query sequence against the profile and/or (b) aligning 
5 the query sequence with the members of the family or motif. Typically, a program such 
as Searchwise is used to compare the query sequence to the statistical representation of 
the multiple alignment, also known as a profile (see Birney et al., supra). Other 
techniques to compare the sequence and profile are described in Sonnhammer et al., 
supra and Doolittle, supra. 

10 Next, methods described by Feng et al., J. MoL EvoL (1987) 25:351 and 

Higgins et al., CABIOS (1989) 5:151 can be used align the query sequence with the 
members of a family or motif, also known as a MSA. Sequence alignments can be 
generated using any of a variety of software tools. Examples include PileUp, which 
creates a multiple sequence alignment, and is described in Feng et al., J. MoL EvoL 

15 (1987) 25:351. Another method, GAP, uses the alignment method of Needleman et al., 
J. MoL BioL (1970) 45:443. GAP is best suited for global alignment of sequences. A 
third method, BestFit, functions by inserting gaps to maximize the number of matches 
using the local homology algorithm of Smith et al., Adv. AppL Math, (1981) 2:482. In 
general, the following factors are used to determine if a similarity between a query 

20 sequence and a profile or MSA exists: (1) number of conserved residues found in the 
query sequence, (2) percentage of conserved residues found in the query sequence, (3) 
number of frameshifts, and (4) spacing between conserved residues. 



make any number of frameshifts when translating the nucleotide sequence to produce 
25 the best alignment. The fewer frameshifts needed to produce an alignment, the stronger 
the similarity or identity between the query and profile or MSAs. For example, a weak 
similarity resulting from no frameshifts can be a better indication of activity or structure 
of a query sequence, than a strong similarity resulting from two frameshifts. Preferably, 
three or fewer frameshifts are found in an alignment; more preferably two or fewer 
30 frameshifts; even more preferably, one or fewer frameshifts; even more preferably, no 
frameshifts are found in an alignment of query and profile or MSAs. 



in all or some of the family or motif members. Alternatively, a position is considered 
conserved if only a certain class of amino acids is found in a particular position in all or 



Similarity between a query sequence and a protein family or motif can be 



Some alignment programs that both translate and align sequences can 



Conserved residues are those amino acids found at a particular position 
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some of the family members. For example, the N-terminal position can contain a 
positively charged amino acid, such as lysine, arginine, or histidine. 

Typically, a residue of a polypeptide is conserved when a class of amino 
acids or a single amino acid is found at a particular position in at least about 40% of all 
5 class members; more typically, at least about 50%; even more typically, at least about 
60% of the members. Usually, a residue is conserved when a class or single amino acid 
is found in at least about 70% of the members of a family or motif; more usually, at 
least about 80%; even more usually, at least about 90%; even more usually, at least 
about 95%. 

1 0 A residue is considered conserved when three unrelated amino acids are 

found at a particular position in the some or all of the members; more usually, two 
unrelated amino acids. These residues are conserved when the unrelated amino acids 
are found at particular positions in at least about 40% of all class member; more 
typically, at least about 50%; even more typically, at least about 60% of the members. 

15 Usually, a residue is conserved when a class or single amino acid is found in at least 
about 70% of the members of a family or motif; more usually, at least about 80%; even 
more usually, at least about 90%; even more usually, at least about 95%. 

A query sequence has similarity to a profile or MSA when the query 
sequence comprises at least about 25% of the conserved residues of the profile or MSA; 

20 more usually, at least about 30%; even more usually; at least about 40%. Typically, the 
query sequence has a stronger similarity to a profile sequence or MSA when the query 
sequence comprises at least about 45% of the conserved residues of the profile or MSA; 
more typically, at least about 50%; even more typically; at least about 55%. 

Identification of Secreted and Membrane-Bound Polypeptides 

25 Both secreted and membrane-bound polypeptides of the present 

invention are of particular interest. For example, levels of secreted polypeptides can be 
assayed in body fluids that are convenient, such as blood, plasma, serum, and other 
body fluids such as urine, prostatic fluid and semen. Membrane-bound polypeptides are 
useful for constructing vaccine antigens or inducing an immune response. Such 

30 antigens would comprise all or part of the extracellular region of the membrane-bound 
polypeptides. Because both secreted and membrane-bound polypeptides comprise a 
fragment of contiguous hydrophobic amino acids, hydrophobicity predicting algorithms 
can be used to identify such polypeptides. 

n 
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A signal sequence is usually encoded by both secreted and membrane- 
bound polypeptide genes to direct a polypeptide to the surface of the cell. The signal 
sequence usually comprises a stretch of hydrophobic residues. Such signal sequences 
can fold into helical structures. Membrane-bound polypeptides typically comprise at 
5 least one transmembrane region that possesses a stretch of hydrophobic amino acids that 
can transverse the membrane. Some transmembrane regions also exhibit a helical 
structure. Hydrophobic fragments within a polypeptide can be identified by using 
computer algorithms. Such algorithms include Hopp & Woods, Proc. Natl Acad. ScL 
USA (1981) 75:3824-3828; Kyte & Doolittle, 1 Mol Biol (1982) 157: 105-132; and 

10 RAOAR algorithm, Degli Esposti et al., Eur. 1 Biochem. (1990) 190: 207-219. 

Another method of identifying secreted and membrane-bound 
polypeptides is to translate the polynucleotides of the invention in all six frames and 
determine if at least 8 contiguous hydrophobic amino acids are present. Those 
translated polypeptides with at least 8; more typically, 10; even more typically, 12 

1 5 contiguous hydrophobic amino acids are considered to be either a putative secreted or 
membrane bound polypeptide. Hydrophobic amino acids include alanine, glycine, 
histidine, isoleucine, leucine, lysine, methionine, phenylalanine, proline, threonine, 
tryptophan, tyrosine, and valine 

Identification of the Function of an Expression Product of a Full-Length Gene 
20 Ribozymes, antisense constructs, and dominant negative mutants can be 

used to determine function of the expression product of a gene corresponding to a 
polynucleotide provided herein. The phosphoramidite method of oligonucleotide 
synthesis can be used to construct antisense molecules and ribozymes. See Beaucage et 
al., Tet. Lett. (1981) 22:1859 and U.S. Patent No. 4,668,777. Automated devices for 
25 synthesis are available to create oligonucleotides using this chemistry. Examples of 
such devices include Biosearch 8600, Models 392 and 394 by Applied Biosystems, a 
division of Perkin-Elmer Corp., Foster City, California, USA; and Expedite by 
Perceptive Biosystems, Framingham, Massachusetts, USA. Synthetic RNA, phosphate 
analog oligonucleotides, and chemically derivatized oligonucleotides can also be 
30 produced, and can be covalently attached to other molecules. RNA oligonucleotides 
can be synthesized, for example, using RNA phosphoramidites. This method can be 
performed on an automated synthesizer, such as Applied Biosystems, Models 392 and 
394, Foster City, California, USA. 
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Oligonucleotides of up to 200 nt can be synthesized, more typically, 100 
nt, more typically 50 nt; even more typically 30 to 40 nt. These synthetic fragments can 
be annealed and ligated together to construct larger fragments. See, for example, 
Sambrook et ah, supra. Trans-cleaving catalytic RNAs (ribozymes) are RNA 
5 molecules possessing endoribonuclease activity. Ribozymes are specifically designed 
for a particular target, and the target message must contain a specific nucleotide 
sequence. They are engineered to cleave any RNA species site-specifically in the 
background of cellular RNA. The cleavage event renders the mRNA unstable and 
prevents protein expression. Importantly, ribozymes can be used to inhibit expression 

10 of a gene of unknown function for the purpose of determining its function in an in vitro 
or in vivo context, by detecting the phenotypic effect. 

Antisense nucleic acids are designed to specifically bind to RNA, 
resulting in the formation of RN A-DNA or RNA-RNA hybrids, with an arrest of DNA 
replication, reverse transcription or messenger RNA translation. Antisense 

15 polynucleotides based on a selected polynucleotide sequence can interfere with 
expression of the corresponding gene. Antisense polynucleotides are typically 
generated within the cell by expression from antisense constructs that contain the 
antisense strand as the transcribed strand. Antisense polynucleotides based on the 
disclosed polynucleotides will bind and/or interfere with the translation of mRNA 

20 comprising a sequence complementary to the antisense polynucleotide. The expression 
products of control cells and cells treated with the antisense construct are compared to 
detect the protein product of the gene corresponding to the polynucleotide upon which 
the antisense construct is based. The protein is isolated and identified using routine 
biochemical methods. 

25 Given the extensive background literature and clinical experience in 

antisense therapy, one skilled in the art can use selected polynucleotides of the 
invention as additional potential therapeutics. The choice of polynucleotide can be 
narrowed by first testing them for binding to "hot spot" regions of the genome of 
cancerous cells. If a polynucleotide is identified as binding to a "hot spot," testing the 

30 polynucleotide as an antisense compound in the corresponding cancer cells is 
warranted. 

Dominant negative mutations also are readily generated for 
corresponding proteins that are active as homomultimers. A mutant polypeptide will 
interact with wild-type polypeptides (made from the other allele) and form a non- 
35 functional multimer. Thus, a mutation is in a substrate-binding domain, a catalytic 
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domain, or a cellular localization domain. Preferably, the mutant polypeptide will be 
overproduced. Point mutations are made that have such an effect. In addition, fusion of 
different polypeptides of various lengths to the terminus of a protein can yield dominant 
negative mutants. General strategies are available for making dominant negative 
5 mutants (see, e.g., Herskowitz, Nature (1987) 329:219). Such techniques can be used to 
create loss of function mutations, which are useful for determining protein function. 

Polypeptides and Variants Thereof 

The polypeptides of the invention include those encoded by the disclosed 
polynucleotides, as well as nucleic acids that, by virtue of the degeneracy of the genetic 

10 code, are not identical in sequence to the disclosed polynucleotides. Thus, the invention 
includes within its scope a polypeptide encoded by a polynucleotide having the 
sequence of any one of SEQ ID NOs: 1-3351 or a variant thereof. 

In general, the term "polypeptide" as used herein refers to both the full 
length polypeptide encoded by the recited polynucleotide, the polypeptide encoded by 

15 the gene represented by the recited polynucleotide, as well as portions or fragments 
thereof "Polypeptides" also includes variants of the naturally occurring proteins, where 
such variants are homologous or substantially similar to the naturally occurring protein, 
and can be of an origin of the same or different species as the naturally occurring 
protein (e.g., human, murine, or some other species that naturally expresses the recited 

20 polypeptide, usually a mammalian species). In general, variant polypeptides have a 
sequence that has at least about 80%, usually at least about 90%, and more usually at 
least about 98% sequence identity with a differentially expressed polypeptide of the 
invention, as measured by BLAST using the parameters described above. The variant 
polypeptides can be naturally or non-natural Iy glycosylated, i.e., the polypeptide has a 

25 glycosylation pattern that differs from the glycosylation pattern found in the 
corresponding naturally occurring protein. 

The invention also encompasses homologs of the disclosed polypeptides 
(or fragments thereof) where the homologs are isolated from other species, i.e., other 
animal or plant species, where such homologs, usually mammalian species, e.g., 

30 rodents, such as mice, rats; domestic animals, e.g., horse, cow, dog, cat; and humans. 
By "homolog" is meant a polypeptide having at least about 35%, usually at least about 
40% and more usually at least about 60% amino acid sequence identity to a particular 
differentially expressed protein as identified above, where sequence identity is 
determined using the BLAST algorithm, with the parameters described above. 
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In general, the polypeptides of the subject invention are provided in a 



non-naturally occurring environment, e.g., are separated from their naturally occurring 
environment. In certain embodiments, the subject protein is present in a composition 
that is enriched for the protein as compared to a control. As such, purified polypeptide 
5 is provided, where by purified is meant that the protein is present in a composition that 
is substantially free of non-differentially expressed polypeptides, where by substantially 
free is meant that less than 90%, usually less than 60% and more usually less than 50% 
of the composition is made up of non-differentially expressed polypeptides. 



10 polypeptides include mutants, fragments, and fusions. Mutants can include amino acid 
substitutions, additions or deletions. The amino acid substitutions can be conservative 
amino acid substitutions or substitutions to eliminate non-essential amino acids, such as 
to alter a glycosylation site, a phosphorylation site or an acetylation site, or to minimize 
misfolding by substitution or deletion of one or more cysteine residues that are not 

15 necessary for function. Conservative amino acid substitutions are those that preserve 
the general charge, hydrophobicity/ hydrophilicity, and/or steric bulk of the amino acid 
substituted. Variants can be designed so as to retain biological activity of a particular 
region of the protein (e.g., a functional domain and/or, where the polypeptide is a 
member of a protein family, a region associated with a consensus sequence). Selection 

20 of amino acid alterations for production of variants can be based upon the accessibility 
(interior vs. exterior) of the amino acid (see, e.g., Go et al., Int. J. Peptide Protein Res. 
(1980) 75:211), the thermostability of the variant polypeptide (see, e.g., Querol et al., 



Prot. Eng. (1996) P:265), desired glycosylation sites (see, e.g., Olsen and Thomsen, J. 
Gen. Microbiol (1991) 737:579), desired disulfide bridges (see, e.g., Clarke et al., 

25 Biochemistry (1993) 52:4322; and Wakarchuk et al., Protein Eng. (1994) 7:1379), 
desired metal binding sites (see, e.g., Toma et al., Biochemistry (1991) 50:97, and 
Haezerbrouck et al., Protein Eng. (1993) 6:643), and desired substitutions with in 
proline loops (see, e.g., Masul et al., Appl. Env. Microbiol (1994) 60:3579). Cysteine- 
depleted muteins can be produced as disclosed in U.S. Patent No. 4,959,314. 

30 Variants also include fragments of the polypeptides disclosed herein, 

particularly biologically active fragments and/or fragments corresponding to functional 
domains. Fragments of interest will typically be at least about 10 aa to at least about 1 5 
aa in length, usually at least about 50 aa in length, and can be as long as 300 aa in length 
or longer, but will usually not exceed about 1000 aa in length, where the fragment will 

35 have a stretch of amino acids that is identical to a polypeptide encoded by a 



Also within the scope of the invention are variants; variants of 
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polynucleotide having a sequence of any SEQ ID NOs: 1-3351, or a homolog thereof 
The protein variants described herein are encoded by polynucleotides that are within the 
scope of the invention. The genetic code can be used to select the appropriate codons to 
construct the corresponding variants. 

5 Computer-Related Embodiments 



information, which information is provided in either biochemical form {e.g., as a 
collection of polynucleotide molecules), or in electronic form (e.g., as a collection of 
polynucleotide sequences stored in a computer-readable form, as in a computer system 

10 and/or as part of a computer program). The sequence information of the 
polynucleotides can be used in a variety of ways, e.g., as a resource for gene discovery, 
as a representation of sequences expressed in a selected cell type (e.g., cell type 
markers), and/or as markers of a given disease or disease state. In general, a disease 
marker is a representation of a gene product that is present in all cells affected by 

15 disease either at an increased or decreased level relative to a normal cell (e.g., a cell of 
the same or similar type that is not substantially affected by disease). For example, a 
polynucleotide sequence in a library can be a polynucleotide that represents an mRNA, 
polypeptide, or other gene product encoded by the polynucleotide, that is either 
overexpressed or underexpressed in a breast ductal cell affected by cancer relative to a 

20 normal (i.e., substantially disease-free) breast cell. 



any suitable form, e.g., electronic or biochemical forms. For example, a library of 
sequence information embodied in electronic form comprises an accessible computer 
data file (or, in biochemical form, a collection of nucleic acid molecules) that contains 

25 the representative nucleotide sequences of genes that are differentially expressed (e.g., 
overexpressed or underexpressed) as between, for example, i) a cancerous cell and a 
normal cell; ii) a cancerous cell and a dysplastic cell; iii) a cancerous cell and a cell 
affected by a disease or condition other than cancer; iv) a metastatic cancerous cell and 
a normal cell and/or non-metastatic cancerous cell; v) a malignant cancerous cell and a 

30 non-malignant cancerous cell (or a normal cell) and/or vi) a dysplastic cell relative to a 
normal cell. Other combinations and comparisons of cells affected by various diseases 
or stages of disease will be readily apparent to the ordinarily skilled artisan. 
Biochemical embodiments of the library include a collection of nucleic acids that have 



In general, a library of polynucleotides is a collection of sequence 



The nucleotide sequence information of the library can be embodied in 
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the sequences of the genes in the library, where the nucleic acids can correspond to the 
entire gene in the library or to a fragment thereof, as described in greater detail below. 



sequence information of a plurality of polynucleotide sequences, where at least one of 
5 the polynucleotides has a sequence of any of SEQ ID NOs: 1-3351. By plurality is 
meant at least 2, usually at least 3 and can include up to all of SEQ ID NOs: 1-3351. 
The length and number of polynucleotides in the library will vary with the nature of the 
library, e.g., if the library is an oligonucleotide array, a cDNA array, a computer 
database of the sequence information, etc. 

10 Where the library is an electronic library, the nucleic acid sequence 

information can be present in a variety of media. "Media" refers to a manufacture, 
other than an isolated nucleic acid molecule, that contains the sequence information of 
the present invention. Such a manufacture provides the genome sequence or a subset 
thereof in a form that can be examined by means not directly applicable to the sequence 

15 as it exists in a nucleic acid. For example, the nucleotide sequence of the present 
invention, e.g., the nucleic acid sequences of any of the polynucleotides of SEQ ID 
NOs: 1-3351, can be recorded on computer readable media, e.g., any medium that can be 
read and accessed directly by a computer. Such media include, but are not limited to: 
magnetic storage media, such as a floppy disc, a hard disc storage medium, and a 

20 magnetic tape; optical storage media such as CD-ROM; electrical storage media such as 
RAM and ROM; and hybrids of these categories such as magnetic/optical storage 
media. One of skill in the art can readily appreciate how any of the presently known 
computer readable mediums can be used to create a manufacture comprising a recording 
of the present sequence information. "Recorded" refers to a process for storing 

25 information on computer readable medium, using any such methods as known in the art. 
Any convenient data storage structure can be chosen, based on the means used to access 
the stored information. A variety of data processor programs and formats can be used 
for storage, e.g., word processing text file, database format, etc. In addition to the 
sequence information, electronic versions of the libraries of the invention can be 

30 provided in conjunction or connection with other computer-readable information and/or 
other types of computer-readable files (e.g., searchable files, executable files, etc., 
including, but not limited to, for example, search program software, etc.). 



information can be accessed for a variety of purposes. Computer software to access 
35 sequence information is publicly available. For example, the BLAST (Altschul et al., 



The polynucleotide libraries of the subject invention generally comprise 



By providing the nucleotide sequence in computer readable form, the 
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supra,) and BLAZE (Brutlag et al. Comp. Chem. (1993) 77:203) search algorithms on a 
Sybase system can be used to identify open reading frames (ORFs) within the genome 
that contain homology to ORFs from other organisms. 

As used herein, "a computer-based system" refers to the hardware 
5 means, software means, and data storage means used to analyze the nucleotide sequence 
information of the present invention. The minimum hardware of the computer-based 
systems of the present invention comprises a central processing unit (CPU), input 
means, output means, and data storage means. A skilled artisan can readily appreciate 
that any one of the currently available computer-based system are suitable for use in the 

10 present invention. The data storage means can comprise any manufacture comprising a 
recording of the present sequence information as described above, or a memory access 
means that can access such a manufacture. 

"Search means" refers to one or more programs implemented on the 
computer-based system, to compare a target sequence or target structural motif, or 

1 5 expression levels of a polynucleotide in a sample, with the stored sequence information. 
Search means can be used to identify fragments or regions of the genome that match a 
particular target sequence or target motif. A variety of known algorithms are publicly 
known and commercially available, e.g., MacPattern (EMBL), BLASTN and BLASTX 
(NCBI). A "target sequence" can be any polynucleotide or amino acid sequence of six 

20 or more contiguous nucleotides or two or more amino acids, preferably from about 10 
to 100 amino acids or from about 30 to 300 nt. A variety of comparing means can be 
used to accomplish comparison of sequence information from a sample (e.g., to analyze 
target sequences, target motifs, or relative expression levels) with the data storage 
means. A skilled artisan can readily recognize that any one of the publicly available 

25 homology search programs can be used as the search means for the computer based 
systems of the present invention to accomplish comparison of target sequences and 
motifs. Computer programs to analyze expression levels in a sample and in controls are 
also known in the art. 

A "target structural motif/* or "target motif," refers to any rationally 

30 selected sequence or combination of sequences in which the sequence(s) are chosen 
based on a three-dimensional configuration that is formed upon the folding of the target 
motif, or on consensus sequences of regulatory or active sites. There are a variety of 
target motifs known in the art. Protein target motifs include, but arc not limited to, 
enzyme active sites and signal sequences. Nucleic acid target motifs include, but are 
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not limited to, hairpin structures, promoter sequences and other expression elements 
such as binding sites for transcription factors. 

A variety of structural formats for the input and output means can be 
used to input and output the information in the computer-based systems of the present 
5 invention. One format for an output means ranks the relative expression levels of 
different polynucleotides. Such presentation provides a skilled artisan with a ranking of 
relative expression levels to determine a gene expression profile. 

As discussed above, the "library" of the invention also encompasses 
biochemical libraries of the polynucleotides of SEQ ID NOs: 1-3351, e.g., collections of 

1 0 nucleic acids representing the provided polynucleotides. The biochemical libraries can 
take a variety of forms, e.g., a solution of cDNAs, a pattern of probe nucleic acids stably 
associated with a surface of a solid support (i.e., an array) and the like. Of particular 
interest are nucleic acid arrays in which one or more of SEQ ID NOs: 1-3351 is 
represented on the array. By array is meant an article of manufacture that has at least a 

1 5 substrate with at least two distinct nucleic acid targets on one of its surfaces, where the 
number of distinct nucleic acids can be considerably higher, typically being at least 10 
nt, usually at least 20 nt and often at least 25 nt. A variety of different array formats 
have been developed and are known to those of skill in the art. The arrays of the subject 
invention find use in a variety of applications, including gene expression analysis, drug 

20 screening, mutation analysis and the like, as disclosed in the above-listed exemplary 
patent documents. 

In addition to the above nucleic acid libraries, analogous libraries of 
polypeptides are also provided, where the where the polypeptides of the library will 
represent at least a portion of the polypeptides encoded by SEQ ID NOs: 1 -335 1 . 

25 Use of Polynucleotide Probes in Mapping, and in Tissue Profiling 

Polynucleotide probes, generally comprising at least 12 contiguous nt of 
a polynucleotide as shown in the Sequence Listing, are used for a variety of purposes, 
such as chromosome mapping of the polynucleotide and detection of transcription 
levels. Additional disclosure about preferred regions of the disclosed polynucleotide 

30 sequences is found in the Examples. A probe that hybridizes specifically to a 
polynucleotide disclosed herein should provide a detection signal at least 5-, 10-, or 20- 
fold higher than the background hybridization provided with other unrelated sequences. 

Detection of Expression Levels . Nucleotide probes are used to detect 
expression of a gene corresponding to the provided polynucleotide. In Northern blots, 

Vo 
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mRNA is separated electrophoretically and contacted with a probe. A probe is detected 
as hybridizing to an mRNA species of a particular size. The amount of hybridization is 
quantitated to determine relative amounts of expression, for example under a particular 
condition. Probes are used for in situ hybridization to cells to detect expression. Probes 
5 can also be used in vivo for diagnostic detection of hybridizing sequences. Probes are 
typically labeled with a radioactive isotope. Other types of detectable labels can be 
used such as chromophores, fluors, and enzymes. Other examples of nucleotide 
hybridization assays are described in WO92/02526 and U.S. Patent No. 5,124,246. 

Alternatively, the Polymerase Chain Reaction (PCR) is another means 

10 for detecting small amounts of target nucleic acids (see, e.g., Mullis et aL, Meth. 
Enzymol (1987) 155:335; U.S. Patent No. 4,683,195; and U.S. Patent No. 4,683,202). 
Two primer polynucleotides nucleotides that hybridize with the target nucleic acids are 
used to prime the reaction. The primers can be composed of sequence within or 3' and 
5' to the polynucleotides of the Sequence Listing. Alternatively, if the primers are 3* and 

15 5' to these polynucleotides, they need not hybridize to them or the complements. After 
amplification of the target with a thermostable polymerase, the amplified target nucleic 
acids can be detected by methods known in the art, e.g., Southern blot. mRNA or 
cDNA can also be detected by traditional blotting techniques (e.g., Southern blot, 
Northern blot, etc.) described in Sambrook et aL, "Molecular Cloning: A Laboratory 

20 Manual" (New York, Cold Spring Harbor Laboratory, 1989) (e.g, without PCR 
amplification). In general, mRNA or cDNA generated from mRNA using a polymerase 
enzyme can be purified and separated using gel electrophoresis, and transferred to a 
solid support, such as nitrocellulose. The solid support is exposed to a labeled probe, 
washed to remove any unhybridized probe, and duplexes containing the labeled probe 

25 are detected. 

Mapping . Polynucleotides of the present invention can be used to 
identify a chromosome on which the corresponding gene resides. Such mapping can be 
useful in identifying the function of the polynucleotide-related gene by its proximity to 
other genes with known function. Function can also be assigned to the polynucleotide- 

30 related gene when particular syndromes or diseases map to the same chromosome. For 
example, use of polynucleotide probes in identification and quantification of nucleic 
acid sequence aberrations is described in U.S. Patent No. 5,783,387. An exemplary 
mapping method is fluorescence in situ hybridization (FISH), which facilitates 
comparative genomic hybridization to allow total genome assessment of changes in 

35 relative copy number of DNA sequences (see, e.g., Valdes et aL, Methods in Molecular 



WQ0102568 [fite://C:\Documents and Settings\Lawrence\Local Setting sYTemporary Internet Files\QLK11\WO0102568.cpc1 Page 28 of 1046 

WO 01/02568 PCT/US00/18374 

Biology (1997) 68:1). Polynucleotides can also be mapped to particular chromosomes 
using, for example, radiation hybrids or chromosome-specific hybrid panels. See Leach 
et ah, Advances in Genetics, (1995) 35:63-99; Walter et al., Nature Genetics (1994) 
7:22; Walter and Goodfellow, Trends in Genetics (1992) P.352. Panels for radiation 
5 hybrid mapping are available from Research Genetics, Inc., Huntsville, Alabama, USA. 
The statistical program RHMAP can be used to construct a map based on the data from 
radiation hybridization with a measure of the relative likelihood of one order versus 
another. RHMAP is available via the world wide web at http://www.sph.umich.edu- 
/group/statgen/software. In addition, commercial programs are available for identifying 

10 regions of chromosomes commonly associated with disease, such as cancer. 

Tissue Typing or Profiling . Expression of specific mRNA 
corresponding to the provided polynucleotides can vary in different cell types and can 
be tissue-specific. This variation of mRNA levels in different cell types can be 
exploited with nucleic acid probe assays to determine tissue types. For example, PCR, 

15 branched DNA probe assays, or blotting techniques utilizing nucleic acid probes 
substantially identical or complementary to polynucleotides listed in the Sequence 
Listing can determine the presence or absence of the corresponding cDNA or mRNA. 

Tissue typing can be used to identify the developmental organ or tissue 
source of a metastatic lesion by identifying the expression of a particular marker of that 

20 organ or tissue. If a polynucleotide is expressed only in a specific tissue type, and a 
metastatic lesion is found to express that polynucleotide, then the developmental source 
of the lesion has been identified. Expression of a particular polynucleotide can be 
assayed by detection of either the corresponding mRNA or the protein product. 

Use of Polymorphisms . A polynucleotide of the invention can be used in 

25 forensics, genetic analysis, mapping, and diagnostic applications where the 
corresponding region of a gene is polymorphic in the human population. Any means for 
detecting a polymorphism in a gene can be used, including, but not limited to 
electrophoresis of protein polymorphic variants, differential sensitivity to restriction 
enzyme cleavage, and hybridization to allele-specific probes. 

30 Antibody Production 

Expression products of a polynucleotide of the invention, as well as the 
corresponding mRNA, cDNA, or complete gene, can be prepared and used for raising 
antibodies for experimental, diagnostic, and therapeutic purposes. For polynucleotides 
to which a corresponding gene has not been assigned, this provides an additional 

^1 
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method of identifying the corresponding gene. The polynucleotide or related cDNA is 
expressed as described above, and antibodies are prepared. These antibodies are 
specific to an epitope on the polypeptide encoded by the polynucleotide, and can 
precipitate or bind to the corresponding native protein in a cell or tissue preparation or 
5 in a cell-free extract of an in vitro expression system. 



specifically bind a selected antigen are well known in the art. The antibodies 
specifically bind to epitopes present in the polypeptides encoded by polynucleotides 
disclosed in the Sequence Listing. Typically, at least 6, 8, 10, or 12 contiguous amino 

10 acids are required to form an epitope. Epitopes that involve non-contiguous amino 
acids may require a longer polypeptide, e.g., at least 15, 25, or 50 amino acids. 
Antibodies that specifically bind to human polypeptides encoded by the provided 
polynucleotides should provide a detection signal at least 5-, 10-, or 20-fold higher than 
a detection signal provided with other proteins when used in Western blots or other 

15 immunochemical assays. Preferably, antibodies that specifically polypeptides of the 
invention do not bind to other proteins in immunochemical assays at detectable levels 
and can immunoprecipitate the specific polypeptide from solution. 



for a polypeptide of the invention. For example, serum antibodies to a polypeptide of 
20 the invention in a human population can be purified by methods well known in the art, 
e.g., by passing antiserum over a column to which the corresponding selected 
polypeptide or fusion protein is bound. The bound antibodies can then be eluted from 
the column, for example using a buffer with a high salt concentration. 



25 contemplates genetically engineered antibodies, antibody derivatives (e.g., single chain 
antibodies, antibody fragments (e.g., Fab, etc.)), according to methods well known in 
the art. 



monoclonal antibodies capable of binding to the polypeptides of the invention. The 
30 phrase "humanized antibody" refers to an antibody derived from a non-human antibody 
- typically a mouse monoclonal antibody. Alternatively, a humanized antibody may be 
derived from a chimeric antibody that retains or substantially retains the antigen- 
binding properties of the parental, non-human, antibody but which exhibits diminished 
immunogenicity as compared to the parental antibody when administered to humans. 
35 The phrase "chimeric antibody," as used herein, refers to an antibody containing 



Methods for production of monoclonal and polyclonal antibodies that 



The invention also contemplates naturally occurring antibodies specific 



In addition to the antibodies discussed above, the invention also 



Other embodiments of the present invention include humanized 
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sequence derived from two different antibodies (see, e.g., U.S. Patent No. 4,816,567) 
which typically originate from different species. Most typically, chimeric antibodies 
comprise human and murine antibody fragments, generally human constant and mouse 
variable regions. 

5 Because humanized antibodies are far less immunogenic in humans than 

the parental mouse monoclonal antibodies, they can be used for the treatment of humans 
with far less risk of anaphylaxis. Thus, these antibodies may be preferred in therapeutic 
applications that involve in vivo administration to a human such as, e.g., use as radiation 
sensitizers for the treatment of neoplastic disease or use in methods to reduce the side 
1 0 effects of, e.g., cancer therapy. 

Humanized antibodies may be achieved by a variety of methods 
including, for example: (1) grafting the non-human complementarity determining 
regions (CDRs) onto a human framework and constant region (a process referred to in 
the art as "humanizing"), or, alternatively, (2) transplanting the entire non-human 
15 variable domains, but "cloaking" them with a human-like surface by replacement of 
surface residues (a process referred to in the art as "veneering"). In the present 
invention, humanized antibodies will include both "humanized" and "veneered" 
antibodies. These methods are disclosed in, e.g., Jones et al., Nature 527:522-525 
(1986); Morrison et al, Proc. Natl Acad. ScL, U.S.A., 57:6851-6855 (1984); Morrison 
20 and Oi, Adv. Immunol., 44:65-92 (1988); Verhoeyer et al., Science 2JP:1534-1536 
(1988); Padlan, Molec. Immun. 25:489-498 (1991); Padlan, Molec. Immunol 3I(3):\69~ 
217 (1994); and Kettleborough, C.A. et al, Protein Eng. 4(7):773-&3 (1991) each of 
which is incorporated herein by reference. 

The phrase "complementarity determining region" refers to amino acid 
25 sequences which together define the binding affinity and specificity of the natural Fv 
region of a native immunoglobulin binding site. See, e.g., Chothia et al, J. Mol Biol 
7P6:901-917 (1987); Kabat et al., U.S. Dept. of Health and Human Services NIH 
Publication No. 91-3242 (1991). The phrase "constant region" refers to the portion of 
the antibody molecule that confers effector functions. In the present invention, mouse 
30 constant regions are substituted by human constant regions. The constant regions of the 
subject humanized antibodies are derived from human immunoglobulins. The heavy 
chain constant region can be selected from any of the five isotypes: alpha, delta, 
epsilon, gamma or mu. 

One method of humanizing antibodies comprises aligning the non- 
35 human heavy and light chain sequences to human heavy and light chain sequences, 

7? 
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selecting and replacing the non-human framework with a human framework based on 
such alignment, molecular modeling to predict the conformation of the humanized 
sequence and comparing to the conformation of the parent antibody. This process is 
followed by repeated back mutation of residues in the CDR region which disturb the 
5 structure of the CDRs until the predicted conformation of the humanized sequence 
model closely approximates the conformation of the non-human CDRs of the parent 
non-human antibody. Such humanized antibodies may be further derivatized to 
facilitate uptake and clearance, e.g., via Ashwell receptors. See, e.g., U.S. Patent Nos. 
5,530,101 and 5,585,089 which patents are incorporated herein by reference. 

10 Humanized antibodies can also be produced using transgenic animals 

that are engineered to contain human immunoglobulin loci. For example, WO 
98/24893 discloses transgenic animals having a human Ig locus wherein the animals do 
not produce functional endogenous immunoglobulins due to the inactivation of 
endogenous heavy and light chain loci. WO 91/10741 also discloses transgenic non- 

1 5 primate mammalian hosts capable of mounting an immune response to an immunogen, 
wherein the antibodies have primate constant and/or variable regions, and wherein the 
endogenous immunoglobulin-encoding loci are substituted or inactivated. WO 
96/30498 discloses the use of the Cre/Lox system to modify the immunoglobulin locus 
in a mammal, such as to replace all or a portion of the constant or variable region to 

20 form a modified antibody molecule. WO 94/02602 discloses non-human mammalian 
hosts having inactivated endogenous Ig loci and functional human Ig loci. U.S. Patent 
No. 5,939,598 discloses methods of making transgenic mice in which the mice lack 
endogenous heavy claims, and express an exogenous immunoglobulin locus comprising 
one or more xenogeneic constant regions. 

25 Using a transgenic animal described above, an immune response can be 

produced to a selected antigenic molecule, and antibody-producing cells can be 
removed from the animal and used to produce hybridomas that secrete human 
monoclonal antibodies. Immunization protocols, adjuvants, and the like are known in 
the art, and are used in immunization of, for example, a transgenic mouse as described 

30 in WO 96/33735. This publication discloses monoclonal antibodies against a variety of 
antigenic molecules including IL-6, IL-8, TNF , human CD4, L-selectin, gp39, and 
tetanus toxin. The monoclonal antibodies can be tested for the ability to inhibit or 
neutralize the biological activity or physiological effect of the corresponding protein. 
WO 96/33735 discloses that monoclonal antibodies against IL-8, derived from immune 

35 cells of transgenic mice immunized with IL-8, blocked IL-8-induced functions of 

Op 
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neutrophils. Human monoclonal antibodies with specificity for the antigen used to 
immunize transgenic animals are also disclosed in WO 96/34096. 

Polynucleotides or Arrays for Diagnostics 

5 Polynucleotide arrays are created by spotting polynucleotide probes onto 

a substrate (e.g., glass, nitrocellose, etc.) in a two-dimensional matrix or array having 
bound probes. The probes can be bound to the substrate by either covalent bonds or by 
non-specific interactions, such as hydrophobic interactions. Samples of polynucleotides 
can be detectably labeled (e.g., using radioactive or fluorescent labels) and then 

10 hybridized to the probes. Double stranded polynucleotides, comprising the labeled 
sample polynucleotides bound to probe polynucleotides, can be detected once the 
unbound portion of the sample is washed away. Techniques for constructing arrays and 
methods of using these arrays are described in EP 799 897; WO 97/29212; WO 
97/27317; EP 785 280; WO 97/02357; U.S. Patent No. 5,593,839; U.S. Patent No. 

15 5,578,832; EP 728 520; U.S. Patent No. 5,599,695; EP 721 016; U.S. Patent No. 
5,556,752; WO 95/22058; and U.S. Patent No. 5,631,734. Arrays can be used to, for 
example, examine differential expression of genes and can be used to determine gene 
function. For example, arrays can be used to detect differential expression of a 
polynucleotide between a test cell and control cell (e.g., cancer cells and normal cells). 

20 For example, high expression of a particular message in a cancer cell, which is not 
observed in a corresponding normal cell, can indicate a cancer specific gene product. 
Exemplary uses of arrays are further described in, for example, Pappalarado et al., Sent. 
Radiation Oncol (1998) 5:217; and Ramsay, Nature Biotechnol (1998) 76:40. 

Differential Expression in Diagnosis 

25 The polynucleotides of the invention can also be used to detect 

differences in expression levels between two cells, e.g., as a method to identify 
abnormal or diseased tissue in a human. For polynucleotides corresponding to profiles 
of protein families, the choice of tissue can be selected according to the putative 
biological function. In general, the expression of a gene corresponding to a specific 

30 polynucleotide is compared between a first tissue that is suspected of being diseased 
and a second, normal tissue of the human. The tissue suspected of being abnormal or 
diseased can be derived from a different tissue type of the human, but preferably it is 
derived from the same tissue type; for example an intestinal polyp or other abnormal 
growth should be compared with normal intestinal tissue. The normal tissue can be the 

21 
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same tissue as that of the test sample, or any normal tissue of the patient, especially 
those that express the polynucleotide-related gene of interest (e.g., brain, thymus, testis, 
heart, prostate, placenta, spleen, small intestine, skeletal muscle, pancreas, and the 
mucosal lining of the colon). A difference between the polynucleotide-related gene, 
5 mRNA, or protein in the two tissues which are compared, for example in molecular 
weight, amino acid or nucleotide sequence, or relative abundance, indicates a change in 
the gene, or a gene which regulates it, in the tissue of the human that was suspected of 
being diseased. Examples of detection of differential expression and its use in diagnosis 
of cancer are described in U.S. Patent Nos. 5,688,641 and 5,677,125. 

10 A genetic predisposition to disease in a human can also be detected by 

comparing expression levels of an mRNA or protein corresponding to a polynucleotide 
of the invention in a fetal tissue with levels associated in normal fetal tissue. Fetal 
tissues that are used for this purpose include, but are not limited to, amniotic fluid, 
chorionic villi, blood, and the blastomere of an in vz/ro-fertilized embryo. The 

1 5 comparable normal polynucleotide-related gene is obtained from any tissue. The mRNA 
or protein is obtained from a normal tissue of a human in which the polynucleotide- 
related gene is expressed. Differences such as alterations in the nucleotide sequence or 
size of the same product of the fetal polynucleotide-related gene or mRNA, or 
alterations in the molecular weight, amino acid sequence, or relative abundance of fetal 

20 protein, can indicate a germline mutation in the polynucleotide-related gene of the fetus, 
which indicates a genetic predisposition to disease. In general, diagnostic, prognostic, 
and other methods of the invention based on differential expression involve detection of 
a level or amount of a gene product, particularly a differentially expressed gene product, 
in a test sample obtained from a patient suspected of having or being susceptible to a 

25 disease (e.g., breast cancer, lung cancer, colon cancer and/or metastatic forms thereof)? 
and comparing the detected levels to those levels found in normal cells (e.g., cells 
substantially unaffected by cancer) and/or other control cells (e.g., to differentiate a 
cancerous cell from a cell affected by dysplasia). Furthermore, the severity of the 
disease can be assessed by comparing the detected levels of a differentially expressed 

30 gene product with those levels detected in samples representing the levels of 
differentially gene product associated with varying degrees of severity of disease. It 
should be noted that use of the term "diagnostic" herein is not necessarily meant to 
exclude "prognostic" or "prognosis," but rather is used as a matter of convenience. 

The term "differentially expressed gene" is generally intended to 

35 encompass a polynucleotide that can, for example, include an open reading frame 
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encoding a gene product (e.g., a polypeptide), and/or introns of such genes and adjacent 
5 r and 3' non-coding nucleotide sequences involved in the regulation of expression, up 
to about 20 kb beyond the coding region, but possibly further in either direction. The 
gene can be introduced into an appropriate vector for extrachromosomal maintenance or 
5 for integration into a host genome. In general, a difference in expression level 
associated with a decrease in expression level of at least about 25%, usually at least 
about 50% to 75%, more usually at least about 90% or more is indicative of a 
differentially expressed gene of interest, i.e., a gene that is underexpressed or down- 
regulated in the test sample relative to a control sample. Furthermore, a difference in 

10 expression level associated with an increase in expression of at least about 25%, usually 
at least about 50% to 75%, more usually at least about 90% and can be at least about 
1 VJ-fold, usually at least about 2-fold to about 10- fold, and can be about 100- fold to 
about 1,000-fold increase relative to a control sample is indicative of a differentially 
expressed gene of interest, z.e., an overexpressed or up-regulated gene. 

15 "Differentially expressed polynucleotide" as used herein means a nucleic 

acid molecule (RNA or DNA) comprising a sequence that represents a differentially 
expressed gene, e.g., the differentially expressed polynucleotide comprises a sequence 
(e.g., an open reading frame encoding a gene product) that uniquely identifies a 
differentially expressed gene so that detection of the differentially expressed 

20 polynucleotide in a sample is correlated with the presence of a differentially expressed 
gene in a sample. "Differentially expressed polynucleotides" is also meant to 
encompass fragments of the disclosed polynucleotides, e.g., fragments retaining 
biological activity, as well as nucleic acids homologous, substantially similar, or 
substantially identical (e.g., having about 90% sequence identity) to the disclosed 

25 polynucleotides. 



subject's susceptibility to a disease or disorder, determination as to whether a subject is 
presently affected by a disease or disorder, as well as to the prognosis of a subject 
affected by a disease or disorder (e:g., identification of pre-metastatic or metastatic 
30 cancerous states, stages of cancer, or responsiveness of cancer to therapy). The present 
invention particularly encompasses diagnosis of subjects in the context of breast cancer 
(e.g., carcinoma in situ (e.g., ductal carcinoma in situ), estrogen receptor (ER)-positive 
breast cancer, ER-negative breast cancer, or other forms and/or stages of breast cancer), 
lung cancer (e.g., small cell carcinoma, non-small cell carcinoma, mesothelioma, and 



Diagnosis" as used herein generally includes determination of a 
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other forms and/or stages of lung cancer), and colon cancer (e.g., adenomatous polyp, 
colorectal carcinoma, and other forms and/or stages of colon cancer). 



meant to refer to samples of biological fluids or tissues, particularly samples obtained 
5 from tissues, especially from cells of the type associated with the disease for which the 
diagnostic application is designed (e.g., ductal adenocarcinoma), and the like. 
"Samples" is also meant to encompass derivatives and fractions of such samples (e.g., 
cell lysates). Where the sample is solid tissue, the cells of the tissue can be dissociated 
or tissue sections can be analyzed. 

10 Methods of the subject invention useful in diagnosis or prognosis 

typically involve comparison of the abundance of a selected differentially expressed 
gene product in a sample of interest with that of a control to determine any relative 
differences in the expression of the gene product, where the difference can be measured 
qualitatively and/or quantitatively. Quantitation can be accomplished, for example, by 

15 comparing the level of expression product detected in the sample with the amounts of 
product present in a standard curve. A comparison can be made visually; by using a 
technique such as densitometry, with or without computerized assistance; by preparing 
a representative library of cDNA clones of mRNA isolated from a test sample, 
sequencing the clones in the library to determine that number of cDNA clones 

20 corresponding to the same gene product, and analyzing the number of clones 
corresponding to that same gene product relative to the number of clones of the same 
gene product in a control sample; or by using an array to detect relative levels of 
hybridization to a selected sequence or set of sequences, and comparing the 
hybridization pattern to that of a control. The differences in expression are then 

25 correlated with the presence or absence of an abnormal expression pattern. A variety of 
different methods for determining the nucleic acid abundance in a sample are known to 
those of skill in the art (see, e.g., WO 97/273 17).In general, diagnostic assays of the 
invention involve detection of a gene product of a the polynucleotide sequence (e.g., 
mRNA or polypeptide) that corresponds to a sequence of SEQ ID NOs: 1-3351. The 

30 patient from whom the sample is obtained can be apparently healthy, susceptible to 
disease (e.g., as determined by family history or exposure to certain environmental 
factors), or can already be identified as having a condition in which altered expression 
of a gene product of the invention is implicated. 



35 levels of a gene product encoded by at least one, preferably at least two or more, at least 



Sample" or "biological sample" as used throughout here are generally 



Diagnosis can be determined based on detected gene product expression 
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3 or more, or at least 4 or more of the polynucleotides having a sequence set forth in 
SEQ ID NOs: 1-335 1, and can involve detection of expression of genes corresponding to 
all of SEQ ID NOs: 1-3351 and/or additional sequences that can serve as additional 
diagnostic markers and/or reference sequences. Where the diagnostic method is 
5 designed to detect the presence or susceptibility of a patient to cancer, the assay 
preferably involves detection of a gene product encoded by a gene corresponding to a 
polynucleotide that is differentially expressed in cancer. Examples of such differentially 
expressed polynucleotides are described in the Examples below. Given the provided 
polynucleotides and information regarding their relative expression levels provided 
10 herein, assays using such polynucleotides and detection of their expression levels in 
diagnosis and prognosis will be readily apparent to the ordinarily skilled artisan. 



various embodiments of the diagnostic methods of the invention. Suitable detectable 
labels include fluorochromes, (e.g., fluorescein isothiocyanate (FITC), rhodamine, 

15 Texas Red, phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 
2 9 J 9 -dimethoxy-4 \5 ' -dichloro-6-carboxy fluorescein, 6-carboxy-X-rhodamine (ROX), 
6-carboxy-2',4',7 , ,4,7-hexachlorofluorescein (HEX), 5 -carboxy fluorescein (5-FAM) or 
N,N,N\N'-tetramethyl-6-carboxyrhodamine (TAMRA)), radioactive labels, (e.g., 32 P, 
S, 3 H, etc.), and the like. The detectable label can involve a two stage systems (e.g., 

20 biotin-avidin, hapten-anti-hapten antibody, etc.) 



invention, such as antibodies and nucleotide probes, can be supplied in a kit for 
detecting the presence of an expression product in a biological sample. The kit can also 
contain buffers or labeling components, as well as instructions for using the reagents to 
25 detect and quantify expression products in the biological sample. Exemplary 
embodiments of the diagnostic methods of the invention are described below in more 
detail. 



is assayed for the level of a differentially expressed polypeptide. Diagnosis can be 
30 accomplished using any of a number of methods to determine the absence or presence 
or altered amounts of the differentially expressed polypeptide in the test sample. For 
example, detection can utilize staining of cells or histological sections with labeled 
antibodies, performed in accordance with conventional methods. Cells can be 
permeabilized to stain cytoplasmic molecules. In general, antibodies that specifically 
35 bind a differentially expressed polypeptide of the invention are added to a sample, and 



Any of a variety of detectable labels can be used in connection with the 



Reagents specific for the polynucleotides and polypeptides of the 



Polypeptide detection in diagnosis . In one embodiment, the test sample 
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incubated for a period of time sufficient to allow binding to the epitope, usually at least 
about 10 minutes. The antibody can be detectably labeled for direct detection {e.g., 
using radioisotopes, enzymes, fluorescers, chemiluminescers, and the like), or can be 
used in conjunction with a second stage antibody or reagent to detect binding (e.g., 
5 biotin with horseradish peroxidase-conjugated avidin, a secondary antibody conjugated 
to a fluorescent compound, e.g., fluorescein, rhodamine, Texas red, etc.). The absence 
or presence of antibody binding can be determined by various methods, including flow 
cytometry of dissociated cells, microscopy, radiography, scintillation counting, etc. 
Any suitable alternative methods can of qualitative or quantitative detection of levels or 

10 amounts of differentially expressed polypeptide can be used, for example ELISA, 
western blot, immunoprecipitation, radioimmunoassay, etc. 

mRNA detection . The diagnostic methods of the invention can also or 
alternatively involve detection of mRNA encoded by a gene corresponding to a 
differentially expressed polynucleotides of the invention. Any suitable qualitative or 

15 quantitative methods known in the art for detecting specific mRNAs can be used. 
mRNA can be detected by, for example, in situ hybridization in tissue sections, by 
reverse transcriptase-PCR, or in Northern blots containing poly A+ mRNA. One of 
skill in the art can readily use these methods to determine differences in the size or 
amount of mRNA transcripts between two samples. mRNA expression levels in a 

20 sample can also be determined by generation of a library of expressed sequence tags 
(ESTs) from the sample, where the EST library is representative of sequences present in 
the sample (Adams, et al., (1991) Science 252:1651). Enumeration of the relative 
representation of ESTs within the library can be used to approximate the relative 
representation of the gene transcript within the starting sample. The results of EST 

25 analysis of a test sample can then be compared to EST analysis of a reference sample to 
determine the relative expression levels of a selected polynucleotide, particularly a 
polynucleotide corresponding to one or more of the differentially expressed genes 
described herein. Alternatively, gene expression in a test sample can be performed 
using serial analysis of gene expression (SAGE) methodology (e.g., Velculescu et al., 

30 Science (1995) 270:484) or differential display (DD) methodology (see, e.g., U.S. 
Patent NOs. 5,776,683 and 5,807,680). 

Alternatively, gene expression can be analyzed using hybridization 
analysis. Oligonucleotides or cDNA can be used to selectively identify or capture DNA 
or RNA of specific sequence composition, and the amount of RNA or cDNA hybridized 

35 to a known capture sequence determined qualitatively or quantitatively, to provide 

3<<, 
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information about the relative representation of a particular message within the pool of 
cellular messages in a sample. Hybridization analysis can be designed to allow for 
concurrent screening of the relative expression of hundreds to thousands of genes by 
using, for example, array-based technologies having high density formats, including 
5 filters, microscope slides, or microchips, or solution-based technologies that use 
spectroscopic analysis (e.g., mass spectrometry). One exemplary use of arrays in the 
diagnostic methods of the invention is described below in more detail. 



of the invention can focus on the expression of a single differentially expressed gene. 

10 For example, the diagnostic method can involve detecting a differentially expressed 
gene, or a polymorphism of such a gene (e.g., a polymorphism in an coding region or 
control region), that is associated with disease. Disease-associated polymorphisms can 
include deletion or truncation of the gene, mutations that alter expression level and/or 
affect activity of the encoded protein, etc. 

15 A number of methods are available for analyzing nucleic acids for the 

presence of a specific sequence, e.g., a disease associated polymorphism. Where large 
amounts of DNA are available, genomic DNA is used directly. Alternatively, the 
region of interest is cloned into a suitable vector and grown in sufficient quantity for 
analysis. Cells that express a differentially expressed gene can be used as a source of 

20 mRNA, which can be assayed directly or reverse transcribed into cDNA for analysis. 
The nucleic acid can be amplified by conventional techniques, such as the polymerase 
chain reaction (PCR), to provide sufficient amounts for analysis, and a detectable label 
can be included in the amplification reaction (e.g., using a detectably labeled primer or 
detectably labeled oligonucleotides) to facilitate detection. Alternatively, various 

25 methods are also known in the art that utilize oligonucleotide ligation as a means of 
detecting polymorphisms, see e.g., Riley et al., NucL Acids Res. (1990) 7#:2887; and 
Delahunty et ah, Am. J. Hum. Genet (1996) 55:1239. 



number of methods known in the art. The nucleic acid can be sequenced by dideoxy or 
30 other methods, and the sequence of bases compared to a selected sequence, e.g., to a 
wild-type sequence. Hybridization with the polymorphic or variant sequence can also 
be used to determine its presence in a sample (e.g., by Southern blot, dot blot, etc.). The 
hybridization pattern of a polymorphic or variant sequence and a control sequence to an 
array of oligonucleotide probes immobilized on a solid support, as described in U.S. 
35 Patent No. 5,445,934, or in WO 95/35505, can also be used as a means of identifying 



Use of a single gene in diagnostic applications . The diagnostic methods 



The amplified or cloned sample nucleic acid can be analyzed by one of a 
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polymorphic or variant sequences associated with disease. Single strand 
conformational polymorphism (SSCP) analysis, denaturing gradient gel electrophoresis 
(DGGE), and heteroduplex analysis in gel matrices are used to detect conformational 
changes created by DNA sequence variation as alterations in electrophoretic mobility. 
5 Alternatively, where a polymorphism creates or destroys a recognition site for a 
restriction endonuclease, the sample is digested with that endonuclease, and the 
products size fractionated to determine whether the fragment was digested. 
Fractionation is performed by gel or capillary electrophoresis, particularly acrylamide or 
agarose gels. 

10 Screening for mutations in a gene can be based on the functional or 

antigenic characteristics of the protein. Protein truncation assays are useful in detecting 
deletions that can affect the biological activity of the protein. Various immunoassays 
designed to detect polymorphisms in proteins can be used in screening. Where many 
diverse genetic mutations lead to a particular disease phenotype, functional protein 

1 5 assays have proven to be effective screening tools. The activity of the encoded protein 
can be determined by comparison with the wild-type protein. 

Pattern matching in diagnosis using arrays . In another embodiment, the 
diagnostic and/or prognostic methods of the invention involve detection of expression 
of a selected set of genes in a test sample to produce a test expression pattern (TEP). 

20 The TEP is compared to a reference expression pattern (REP), which is generated by 
detection of expression of the selected set of genes in a reference sample (e.g., a 
positive or negative control sample). The selected set of genes includes at least one of 
the genes of the invention, which genes correspond to the polynucleotide sequences of 
SEQ ID NOs: 1-335 1 . Of particular interest is a selected set of genes that includes genes 

25 differentially expressed in the disease for which the test sample is to be screened. 

"Reference sequences" or "reference polynucleotides" as used herein in 
the context of differential gene expression analysis and diagnosis/prognosis refers to a 
selected set of polynucleotides, which selected set includes at least one or more of the 
differentially expressed polynucleotides described herein. A plurality of reference 

30 sequences, preferably comprising positive and negative control sequences, can be 
included as reference sequences. Additional suitable reference sequences are found in 
Genbank, Unigene, and other nucleotide sequence databases (including, e.g., expressed 
sequence tag (EST), partial, and full-length sequences). 

"Reference array" means an array having reference sequences for use in 

35 hybridization with a sample, where the reference sequences include all, at least one of, 

3% 
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or any subset of the differentially expressed polynucleotides described herein. Usually 
such an array will include at least 3 different reference sequences, and can include any 
one or all of the provided differentially expressed sequences. Arrays of interest can 
further comprise sequences, including polymorphisms, of other genetic sequences, 
5 particularly other sequences of interest for screening for a disease or disorder (e.g., 
cancer, dysplasia, or other related or unrelated diseases, disorders, or conditions). The 
oligonucleotide sequence on the array will usually be at least about 12 nt in length, and 
can be of about the length of the provided sequences, or can extend into the flanking 
regions to generate fragments of 1 00 nt to 200 nt in length or more. Reference arrays 

10 can be produced according to any suitable methods known in the art. For example, 
methods of producing large arrays of oligonucleotides are described in U.S. Patent NOs. 
5,134,854 and 5,445,934 using light-directed synthesis techniques. Using a computer 
controlled system, a heterogeneous array of monomers is converted, through 
simultaneous coupling at a number of reaction sites, into a heterogeneous array of 

15 polymers. Alternatively, microarrays are generated by deposition of pre-synthesized 
oligonucleotides onto a solid substrate, for example as described in PCT published 
application no. WO 95/35505. 

A "reference expression pattern" or "REP" as used herein refers to the 
relative levels of expression of a selected set of genes, particularly of differentially 

20 expressed genes, that is associated with a selected cell type, e.g., a normal cell, a 
cancerous cell, a cell exposed to an environmental stimulus, and the like. A "test 
expression pattern" or U TEP" refers to relative levels of expression of a selected set of 
genes, particularly of differentially expressed genes, in a test sample (e.g., a cell of 
unknown or suspected disease state, from which mRNA is isolated). 

25 REPs can be generated in a variety of ways according to methods well 

known in the art. For example, REPs can be generated by hybridizing a control sample 
to an array having a selected set of polynucleotides (particularly a selected set of 
differentially expressed polynucleotides), acquiring the hybridization data from the 
array, and storing the data in a format that allows for ready comparison of the REP with 

30 a TEP. Alternatively, all expressed sequences in a control sample can be isolated and 
sequenced, e.g., by isolating mRNA from a control sample, converting the mRNA into 
cDNA, and sequencing the cDNA. The resulting sequence information roughly or 
precisely reflects the identity and relative number of expressed sequences in the sample. 
The sequence information can then be stored in a format (e.g., a computer-readable 

35 format) that allows for ready comparison of the REP with a TEP. The REP can be 

W 
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normalized prior to or after data storage, and/or can be processed to selectively remove 
sequences of expressed genes that are of less interest or that might complicate analysis 
(e.g., some or all of the sequences associated with housekeeping genes can be 
eliminated from REP data). 
5 TEPs can be generated in a manner similar to REPs, e.g., by hybridizing 

a test sample to an array having a selected set of polynucleotides, particularly a selected 
set of differentially expressed polynucleotides, acquiring the hybridization data from the 
array, and storing the data in a format that allows for ready comparison of the TEP with 
a REP. The REP and TEP to be used in a comparison can be generated simultaneously, 

10 or the TEP can be compared to previously generated and stored REPs. 

In one embodiment of the invention, comparison of a TEP with a REP 
involves hybridizing a test sample with a reference array, where the reference array has 
one or more reference sequences for use in hybridization with a sample. The reference 
sequences include all, at least one of, or any subset of the differentially expressed 

1 5 polynucleotides described herein. Hybridization data for the test sample is acquired, the 
data normalized, and the produced TEP compared with a REP generated using an array 
having the same or similar selected set of differentially expressed polynucleotides. 
Probes that correspond to sequences differentially expressed between the two samples 
will show decreased or increased hybridization efficiency for one of the samples 

20 relative to the other. 

Methods for collection of data from hybridization of samples with a 
reference arrays are well known in the art. For example, the polynucleotides of the 
reference and test samples can be generated using a detectable fluorescent label, and 
hybridization of the polynucleotides in the samples detected by scanning the 

25 microarrays for the presence of the detectable label using, for example, a microscope 
and light source for directing light at a substrate. A photon counter detects fluorescence 
from the substrate, while an x-y translation stage varies the location of the substrate. A 
confocal detection device that can be used in the subject methods is described in U.S. 
Patent No. 5,631,734. A scanning laser microscope is described in Shalon et al., 

30 Genome Res. (1996) 6:639. A scan, using the appropriate excitation line, is performed 
for each fluorophore used. The digital images generated from the scan are then 
combined for subsequent analysis. For any particular array element, the ratio of the 
fluorescent signal from one sample (e.g., a test sample) is compared to the fluorescent 
signal from another sample (e.g., a reference sample), and the relative signal intensity 

35 determined. 



40 
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Methods for analyzing the data collected from hybridization to arrays are 



well known in the art. For example, where detection of hybridization involves a 
fluorescent label, data analysis can include the steps of determining fluorescent intensity 
as a function of substrate position from the data collected, removing outliers, i.e., data 
5 deviating from a predetermined statistical distribution, and calculating the relative 
binding affinity of the targets from the remaining data. The resulting data can be 
displayed as an image with the intensity in each region varying according to the binding 
affinity between targets and probes. 



10 profile corresponding to that associated with a disease or non-disease state by 
comparing the TEP generated from the test sample to one or more REPs generated from 
reference samples (e.g., from samples associated with cancer or specific stages of 
cancer, dysplasia, samples affected by a disease other than cancer, normal samples, 
etc.). The criteria for a match or a substantial match between a TEP and a REP include 

15 expression of the same or substantially the same set of reference genes, as well as 
expression of these reference genes at substantially the same levels (e.g., no significant 
difference between the samples for a signal associated with a selected reference 
sequence after normalization of the samples, or at least no greater than about 25% to 
about 40% difference in signal strength for a given reference sequence. In general, a 

20 pattern match between a TEP and a REP includes a match in expression, preferably a 
match in qualitative or quantitative expression level, of at least one of, all or any subset 
of the differentially expressed genes of the invention. 



a computer program. Methods for preparation of substrate matrices (e.g., arrays), 
25 design of oligonucleotides for use with such matrices, labeling of probes, hybridization 
conditions, scanning of hybridized matrices, and analysis of patterns generated, 
including comparison analysis, are described in, for example, U.S. Patent No. 
5,800,992. 

Diagnosis. Prognosis and Management of Cancer 
30 The polynucleotides of the invention and their gene products are of 

particular interest as genetic or biochemical markers (e.g, in blood or tissues) that will 
detect the earliest changes along the carcinogenesis pathway and/or to monitor the 
efficacy of various therapies and preventive interventions. For example, the level of 
expression of certain polynucleotides can be indicative of a poorer prognosis, and 



In general, the test sample is classified as having a gene expression 



Pattern matching can be performed manually, or can be performed using 
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therefore warrant more aggressive chemo- or radio-therapy for a patient or vice versa. 
The correlation of novel surrogate tumor specific features with response to treatment 
and outcome in patients can define prognostic indicators that allow the design of 
tailored therapy based on the molecular profile of the tumor. These therapies include 
5 antibody targeting and gene therapy. Determining expression of certain polynucleotides 
and comparison of a patients profile with known expression in normal tissue and 
variants of the disease allows a determination of the best possible treatment for a 
patient, both in terms of specificity of treatment and in terms of comfort level of the 
patient. Surrogate tumor markers, such as polynucleotide expression, can also be used 

10 to better classify, and thus diagnose and treat, different forms and disease states of 
cancer. Two classifications widely used in oncology that can benefit from identification 
of the expression levels of the polynucleotides of the invention are staging of the 
cancerous disorder, and grading the nature of the cancerous tissue. 

The polynucleotides of the invention can be useful to monitor patients 

15 having or susceptible to cancer to detect potentially malignant events at a molecular 
level before they are detectable at a gross morphological level. Furthermore, a 
polynucleotide of the invention identified as important for one type of cancer can also 
have implications for development or risk of development of other types of cancer, e.g., 
where a polynucleotide is differentially expressed across various cancer types. Thus, 

20 for example, expression of a polynucleotide that has clinical implications for metastatic 
colon cancer can also have clinical implications for stomach cancer or endometrial 
cancer. 

Staging . Staging is a process used by physicians to describe how 
advanced the cancerous state is in a patient. Generally, if a cancer is only detectable in 

25 the area of the primary lesion without having spread to any lymph nodes it is called 
Stage I. If it has spread only to the closest lymph nodes, it is called Stage II. In Stage 
III, the cancer has generally spread to the lymph nodes in near proximity to the site of 
the primary lesion. Cancers that have spread to a distant part of the body, such as the 
liver, bone, brain or other site, are Stage IV, the most advanced stage. 

30 The polynucleotides of the invention can facilitate fine-tuning of the 

staging process by identifying markers for the aggresivity of a cancer, e.g., the 
metastatic potential, as well as the presence in different areas of the body. Thus, a Stage 
II cancer with a polynucleotide signifying a high metastatic potential cancer can be used 
to change a borderline Stage II tumor to a Stage III tumor, justifying more aggressive 
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therapy. Conversely, the presence of a polynucleotide signifying a lower metastatic 
potential allows more conservative staging of a tumor. 

Grading of cancers . Grade is a term used to describe how closely a 
tumor resembles normal tissue of its same type. The microscopic appearance of a tumor 
5 is used to identify tumor grade based on parameters such as cell morphology, cellular 
organization, and other markers of differentiation. As a general rule, the grade of a 
tumor corresponds to its rate of growth or aggressiveness, with undifferentiated or high- 
grade tumors being more aggressive than well differentiated or low-grade tumors. The 
following guidelines are generally used for grading tumors: 1) GX Grade cannot be 

10 assessed; 2) Gl Well differentiated; G2 Moderately well differentiated; 3) G3 Poorly 
differentiated; 4) G4 Undifferentiated. The polynucleotides of the invention can be 
especially valuable in determining the grade of the tumor, as they not only can aid in 
determining the differentiation status of the cells of a tumor, they can also identify 
factors other than differentiation that are valuable in determining the aggressivity of a 

1 5 tumor, such as metastatic potential. 

Detection of lung cancer . The polynucleotides of the invention can be 
used to detect lung cancer in a subject. Although there are more than a dozen different 
kinds of lung cancer, the two main types of lung cancer are small cell and nonsmall cell, 
which encompass about 90% of all lung cancer cases. Small cell carcinoma (also called 

20 oat cell carcinoma) usually starts in one of the larger bronchial tubes, grows fairly 
rapidly, and is likely to be large by the time of diagnosis. Nonsmall cell lung cancer 
(NSCLC) is made up of three general subtypes of lung cancer. Epidermoid carcinoma 
(also called squamous cell carcinoma) usually starts in one of the larger bronchial tubes 
and grows relatively slowly. The size of these tumors can range from very small to 

25 quite large. Adenocarcinoma starts growing near the outside surface of the lung and can 
vary in both size and growth rate. Some slowly growing adenocarcinomas are described 
as alveolar cell cancer. Large cell carcinoma starts near the surface of the lung, grows 
rapidly, and the growth is usually fairly large when diagnosed. Other less common 
forms of lung cancer are carcinoid, cylindroma, mucoepidermoid, and malignant 

30 mesothelioma. 

The polynucleotides of the invention, e.g., polynucleotides differentially 
expressed in normal cells versus cancerous lung cells (e.g., tumor cells of high or low 
metastatic potential) or between types of cancerous lung cells (e.g., high metastatic 
versus low metastatic), can be used to distinguish types of lung cancer as well as 
35 identifying traits specific to a certain patient's cancer and selecting an appropriate 
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therapy. For example, if the patient's biopsy expresses a polynucleotide that is 
associated with a low metastatic potential, it may justify leaving a larger portion of the 
patient's lung in surgery to remove the lesion. Alternatively, a smaller lesion with 
expression of a polynucleotide that is associated with high metastatic potential may 
5 justify a more radical removal of lung tissue and/or the surrounding lymph nodes, even 
if no metastasis can be identified through pathological examination. 

Detection of breast cancer . The majority of breast cancers are 
adenocarcinomas subtypes, which can be summarized as follows: 1) ductal carcinoma 
in situ (DCIS), including comedocarcinoma; 2) infiltrating (or invasive) ductal 

10 carcinoma (IDC); 3) lobular carcinoma in situ (LCIS); 4) infiltrating (or invasive) 
lobular carcinoma (ILC); 5) inflammatory breast cancer; 6) medullary carcinoma; 
7) mucinous carcinoma; 8) Paget's disease of the nipple; 9) Phyllodes tumor; and 
10) tubular carcinoma. 

The expression of polynucleotides of the invention can be used in the 

15 diagnosis and management of breast cancer, as well as to distinguish between types of 
breast cancer. Detection of breast cancer can be determined using expression levels of 
any of the appropriate polynucleotides of the invention, either alone or in combination. 
Determination of the aggressive nature and/or the metastatic potential of a breast cancer 
can also be determined by comparing levels of one or more polynucleotides of the 

20 invention and comparing levels of another sequence known to vary in cancerous tissue, 
e.g., ER expression. In addition, development of breast cancer can be detected by 
examining the ratio of expression of a differentially expressed polynucleotide to the 
levels of steroid hormones (e.g., testosterone or estrogen) or to other hormones (e.g., 
growth hormone, insulin). Thus expression of specific marker polynucleotides can be 

25 used to discriminate between normal and cancerous breast tissue, to discriminate 
between breast cancers with different cells of origin, to discriminate between breast 
cancers with different potential metastatic rates, etc. 

Detection of colon cancer . The polynucleotides of the invention 
exhibiting the appropriate expression pattern can be used to detect colon cancer in a 

30 subject. Colorectal cancer is one of the most common neoplasms in humans and 
perhaps the most frequent form of hereditary neoplasia. Prevention and early detection 
are key factors in controlling and curing colorectal cancer. Colorectal cancer begins as 
polyps, which are small, benign growths of cells that form on the inner lining of the 
colon. Over a period of several years, some of these polyps accumulate additional 

35 mutations and become cancerous. Multiple familial colorectal cancer disorders have 

if4 
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been identified, which are summarized as follows: 1) Familial adenomatous polyposis 
(FAP); 2) Gardner's syndrome; 3) Hereditary nonpolyposis colon cancer (HNPCC); and 
4) Familial colorectal cancer in Ashkenazi Jews, The expression of appropriate 
polynucleotides of the invention can be used in the diagnosis, prognosis and 
5 management of colorectal cancer. Detection of colon cancer can be determined using 
expression levels of any of these sequences alone or in combination with the levels of 
expression. Determination of the aggressive nature and/or the metastatic potential of a 
colon cancer can be determined by comparing levels of one or more polynucleotides of 
the invention and comparing total levels of another sequence known to vary in 

10 cancerous tissue, e.g., expression of p53, DCC ras, lor FAP (see, e.g., Fearon ER, et aL, 
Cell (1990) <57(J):759; Hamilton SR et al. Cancer (1993) 72:957; Bodmer W, et aL, 
Nat Genet. (1994) 4{3):2\1\ Fearon ER, Ann N Y Acad Sci. (1995) 765:101). For 
example, development of colon cancer can be detected by examining the ratio of any of 
the polynucleotides of the invention to the levels of oncogenes (e.g., ras) or tumor 

15 suppressor genes (e.g., FAP or p53). Thus expression of specific marker 
polynucleotides can be used to discriminate between normal and cancerous colon tissue, 
to discriminate between colon cancers with different cells of origin, to discriminate 
between colon cancers with different potential metastatic rates, etc. 

Use of Polynucleotides to Screen for Peptide Analogs and Antagonists 
20 Polypeptides encoded by the instant polynucleotides and corresponding 

full length genes can be used to screen peptide libraries to identify binding partners, 
such as receptors, from among the encoded polypeptides. Peptide libraries can be 
synthesized according to methods known in the art (see, e.g., U.S. Patent No. 5,010,175, 
and WO 91/17823). Agonists or antagonists of the polypeptides if the invention can be 
25 screened using any available method known in the art, such as signal transduction, 
antibody binding, receptor binding, mitogenic assays, chemotaxis assays, etc. The 
assay conditions ideally should resemble the conditions under which the native activity 
is exhibited in vivo, that is, under physiologic pH, temperature, and ionic strength. 
Suitable agonists or antagonists will exhibit strong inhibition or enhancement of the 
30 native activity at concentrations that do not cause toxic side effects in the subject. 
Agonists or antagonists that compete for binding to the native polypeptide can require 
concentrations equal to or greater than the native concentration, while inhibitors capable 
of binding irreversibly to the polypeptide can be added in concentrations on the order of 
the native concentration. 

MS 
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Such screening and experimentation can lead to identification of a novel 
polypeptide binding partner, such as a receptor, encoded by a gene or a cDNA 
corresponding to a polynucleotide of the invention, and at least one peptide agonist or 
antagonist of the novel binding partner. Such agonists and antagonists can be used to 
5 modulate, enhance, or inhibit receptor function in cells to which the receptor is native, 
or in cells that possess the receptor as a result of genetic engineering. Further, if the 
novel receptor shares biologically important characteristics with a known receptor, 
information about agonist/antagonist binding can facilitate development of improved 
agonists/antagonists of the known receptor. 

10 Pharmaceutical Compositions and Therapeutic Uses 

Pharmaceutical compositions of the invention can comprise 
polypeptides, antibodies, or polynucleotides (including antisense nucleotides and 
ribozymes) of the claimed invention in a therapeutically effective amount. The term 
"therapeutically effective amount" as used herein refers to an amount of a therapeutic 

15 agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 
detectable therapeutic or preventative effect. The effect can be detected by, for 
example, chemical markers or antigen levels. Therapeutic effects also include reduction 
in physical symptoms, such as decreased body temperature. The precise effective 
amount for a subject will depend upon the subject's size and health, the nature and 

20 extent of the condition, and the therapeutics or combination of therapeutics selected for 
administration. Thus, it is not useful to specify an exact effective amount in advance. 
However, the effective amount for a given situation is determined by routine 
experimentation and is within the judgment of the clinician. For purposes of the present 
invention, an effective dose will generally be from about 0.01 mg/ kg to 50 mg/kg or 

25 0.05 mg/kg to about 10 mg/kg of the DNA constructs in the individual to which it is 
administered. 

A pharmaceutical composition can also contain a pharmaceutically 
acceptable carrier. The term "pharmaceutically acceptable carrier" refers to a carrier for 
administration of a therapeutic agent, such as antibodies or a polypeptide, genes, and 
30 other therapeutic agents. The term refers to any pharmaceutical carrier that does not 
itself induce the production of antibodies harmful to the individual receiving the 
composition, and which can be administered without undue toxicity. Suitable carriers 
can be large, slowly metabolized macromolecules such as proteins, polysaccharides, 
polylactic acids, polyglycolic acids, polymeric amino acids, amino acid copolymers, 
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and inactive virus particles. Such carriers are well known to those of ordinary skill in 
the art. Pharmaceutically acceptable carriers in therapeutic compositions can include 
liquids such as water, saline, glycerol and ethanol. Auxiliary substances, such as 
wetting or emulsifying agents, pH buffering substances, and the like, can also be present 
5 in such vehicles. Typically, the therapeutic compositions are prepared as injectables, 
either as liquid solutions or suspensions; solid forms suitable for solution in, or 
suspension in, liquid vehicles prior to injection can also be prepared. Liposomes are 
included within the definition of a pharmaceutically acceptable carrier. 
Pharmaceutically acceptable salts can also be present in the pharmaceutical 

10 composition, e.g., mineral acid salts such as hydrochlorides, hydrobromides, 
phosphates, sulfates, and the like; and the salts of organic acids such as acetates, 
propionates, malonates, benzoates, and the like. A thorough discussion of 
pharmaceutically acceptable excipients is available in Remington's Pharmaceutical 
Sciences (Mack Pub. Co., New Jersey, 1991). 

1 5 Delivery Methods . Once formulated, the compositions of the invention 

can be (1) administered directly to the subject (e.g., as polynucleotide or polypeptides); 
or (2) delivered ex vivo, to cells derived from the subject (e.g. , as in ex vivo gene 
therapy). Direct delivery of the compositions will generally be accomplished by 
parenteral injection, e.g., subcutaneously, intraperitoneal^, intravenously or 

20 intramuscularly, intratumoral or to the interstitial space of a tissue. Other modes of 
administration include oral and pulmonary administration, suppositories, and 
transdermal applications, needles, and gene guns or hyposprays. Dosage treatment can 
be a single dose schedule or a multiple dose schedule. 



25 into a subject are known in the art and described in eg., International Publication No. 
WO 93/14778. Examples of cells useful in ex vivo applications include, for example, 
stem cells, particularly hematopoetic, lymph cells, macrophages, dendritic cells, or 
tumor cells. Generally, delivery of nucleic acids for both ex vivo and in vitro 
applications can be accomplished by, for example, dextran-mediated transfection, 

30 calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct 
microinjection of the DNA into nuclei, all well known in the art. 



found to correlate with a proliferative disorder, such as neoplasia, dysplasia, and 
35 hyperplasia, the disorder can be amenable to treatment by administration of a 



Methods for the ex vivo delivery and reimplantation of transformed cells 



Once a gene corresponding to a polynucleotide of the invention has been 
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therapeutic agent based on the provided polynucleotide, corresponding polypeptide or 
other corresponding molecule (e.g., antisense, ribozyme, etc.). 

The dose and the means of administration of the inventive 
pharmaceutical compositions are determined based on the specific qualities of the 
5 therapeutic composition, the condition, age, and weight of the patient, the progression 
of the disease, and other relevant factors. For example, administration of 
polynucleotide therapeutic compositions agents of the invention includes local or 
systemic administration, including injection, oral administration, particle gun or 
catheterized administration, and topical administration. Preferably, the therapeutic 

10 polynucleotide composition contains an expression construct comprising a promoter 
operably linked to a polynucleotide of at least 12, 22, 25, 30, or 35 contiguous nt of the 
polynucleotide disclosed herein. Various methods can be used to administer the 
therapeutic composition directly to a specific site in the body. For example, a small 
metastatic lesion is located and the therapeutic composition injected several times in 

15 several different locations within the body of tumor. Alternatively, arteries which serve 
a tumor are identified, and the therapeutic composition injected into such an artery, in 
order to deliver the composition directly into the tumor. A tumor that has a necrotic 
center is aspirated and the composition injected directly into the now empty center of 
the tumor. The antisense composition is directly administered to the surface of the 

20 tumor, for example, by topical application of the composition. X-ray imaging is used to 
assist in certain of the above delivery methods. 

Receptor-mediated targeted delivery of therapeutic compositions 
containing an antisense polynucleotide, subgenomic polynucleotides, or antibodies to 
specific tissues can also be used. Receptor-mediated DNA delivery techniques are 

25 described in, for example, Findeis et al., Trends Biotechnol. (1993) 77:202; Chiou et al., 
Gene Therapeutics: Methods And Applications Of Direct Gene Transfer (J.A. Wolff, 
ed.) (1994); Wu et al., J. Biol. Chem. (1988) 2(55:621; Wu et al., J. Biol Chem. (1994) 
269:542; Zenke et al., Proc. Natl Acad. Sci. (USA) (1990) 57:3655; Wu et al., J. Biol 
Chem. (1991) 266:338. Therapeutic compositions containing a polynucleotide are 

30 administered in a range of about 100 ng to about 200 mg of DNA for local 
administration in a gene therapy protocol. Concentration ranges of about 500 ng to 
about 50 mg, about 1 mg to about 2 mg, about 5 mg to about 500 mg, and about 20 mg 
to about 100 mg of DNA can also be used during a gene therapy protocol. Factors such 
as method of action (e.g., for enhancing or inhibiting levels of the encoded gene 

35 product) and efficacy of transformation and expression are considerations which will 
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affect the dosage required for ultimate efficacy of the antisense subgenomic 
polynucleotides. Where greater expression is desired over a larger area of tissue, larger 
amounts of antisense subgenomic polynucleotides or the same amounts readministered 
in a successive protocol of administrations, or several administrations to different 
5 adjacent or close tissue portions of, for example, a tumor site, may be required to effect 
a positive therapeutic outcome. In all cases, routine experimentation in clinical trials 
will determine specific ranges for optimal therapeutic effect. For polynucleotide-related 
genes encoding polypeptides or proteins with anti-inflammatory activity, suitable use, 
doses, and administration are described in U.S. Patent No. 5,654,173. 

10 The therapeutic polynucleotides and polypeptides of the present 

invention can be delivered using gene delivery vehicles. The gene delivery vehicle can 
be of viral or non- viral origin (see generally, Jolly, Cancer Gene Therapy (1994) 7:51; 
Kimura, Human Gene Therapy (1994) 5:845; Connelly, Human Gene Therapy (1995) 
7:185; and Kaplitt, Nature Genetics (1994) 6:148). Expression of such coding 

15 sequences can be induced using endogenous mammalian or heterologous promoters. 
Expression of the coding sequence can be either constitutive or regulated. 

Viral-based vectors for delivery of a desired polynucleotide and 
expression in a desired cell are well known in the art. Exemplary viral-based vehicles 
include, but are not limited to, recombinant retroviruses. (see, e.g., WO 90/07936; WO 

20 94/03622; WO 93/25698; WO 93/25234; U.S. Patent No. 5, 219,740; WO 93/11230; 
WO 93/10218; U.S. Patent No. 4,777,127; GB Patent No. 2,200,651; EP 0 345 242; and 
WO 91/02805), alphavirus-based vectors (e.g., Sindbis virus vectors, Semliki forest 
virus (ATCC VR-67; ATCC VR-1247), Ross River virus (ATCC VR-373; ATCC VR- 
1246) and Venezuelan equine encephalitis virus (ATCC VR-923; ATCC VR-1250; 

25 ATCC VR 1249; ATCC VR-532), and adeno-associated virus (AAV) vectors (see, e.g., 
WO 94/12649, WO 93/03769; WO 93/19191; WO 94/28938; WO 95/1 1984 and WO 
95/00655). Administration of DNA linked to killed adenovirus as described in Curiel, 
Hum. Gene Ther. (1992) 5:147 can also be employed. 

Non-viral delivery vehicles and methods can also be employed, 

30 including, but not limited to, polycationic condensed DNA linked or unlinked to killed 
adenovirus alone (see, e.g., Curiel, Hum. Gene Ther. (1992) 5:147); ligand-linked 
DNA(see, e.g., Wu, J. Biol. Chem. 264: 16985 (1989)); eukaryotic cell delivery vehicles 
cells (see, e.g., U.S. Patent No. 5,814,482; WO 95/07994; WO 96/1 7072; 
WO 95/30763; and WO 97/42338) and nucleic charge neutralization or fusion with cell 

35 membranes. Naked DNA can also be employed. Exemplary naked DNA introduction 
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methods are described in WO 90/1 1092 and U.S. Patent No. 5,580,859. Liposomes that 
can act as gene delivery vehicles are described in U.S. Patent No. 5,422,120; WO 
95/13796; WO 94/23697; WO 91/14445; and EP 0524968. Additional approaches are 
described in Philip, Mol Cell Biol 74:2411 (1994), and in Woffendin, Proc. Natl 
5 Acad. Sci. (\994) 91 :l5i\. 



systems such as the approach described in Woffendin et al., Proc. Natl Acad. Set USA 
97(24):1 1581 (1994). Moreover, the coding sequence and the product of expression of 
such can be delivered through deposition of photopolymerized hydrogel materials or 

10 use of ionizing radiation (see, e.g., U.S. Patent No. 5,206,152 and WO 92/11033). 
Other conventional methods for gene delivery that can be used for delivery of the 
coding sequence include, for example, use of hand-held gene transfer particle gun (see, 
e.g., U.S. Patent No. 5,149,655); use of ionizing radiation for activating transferred gene 
(see, e.g., U.S. Patent No. 5,206,152 and WO 92/1 1033). 

15 The present invention will now be illustrated by reference to the 

following examples which set forth particularly advantageous embodiments. However, 
it should be noted that these embodiments are illustrative and are not to be construed as 
restricting the invention in any way. 



Further non-viral delivery suitable for use includes mechanical delivery 
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EXAMPLES 
EXAMPLE 1 

Source of Biological Materials and Overview of Novel Polynucleotides 

Expressed by the Biological Materials 

5 

Cell lines and human normal and tumor tissue were used to construct 
cDNA libraries from mRNA isolated from the cells and tissues. Most sequences were 
about 275-300 nucleotides in length. The cells lines include Kml2L4-A cell line, a 
high metastatic colon cancer cell line (Morika, W. A. K. et al, Cancer Research (1988) 

10 45:6863). The KM12L4-A cell line is derived from the KM12C cell line. The KM12C 
cell line, which is poorly metastatic (low metastatic) was established in culture from a 
Dukes' stage B2 surgical specimen (Morikawa et al. Cancer Res. (1988) 45:6863). The 
KML4-A is a highly metastatic subline derived from KM12C (Yeatman et al. Nucl. 
Acids. Res. (1995) 25:4007; Bao-Ling et al. Proc. Annu. Meet. Am. Assoc. Cancer. Res. 

15 (1995) 27:3269). The KM12C and KM12C-derived cell lines (e.g., KM12L4, 
KM12L4-A, etc.) are well-recognized in the art as model cell lines for the study of 
colon cancer (see, e.g., Moriakawa et al., supra; Radinsky et al. Clin. Cancer Res. 
(1995) 1:19; Yeatman et al., (1995) supra; Yeatman et al., Clin. Exp. Metastasis (1996) 
14:246). These and other cell lines and tissue are described in Table 6. 

20 The sequences of the isolated polynucleotides were first masked to 

eliminate low complexity sequences using the XBLAST masking program (Claverie 
"Effective Large-Scale Sequence Similarity Searches," In: Computer Methods for 
Macromolecular SeQuence Analysis . Doolittle, ed., Meth. Enzymol. 266:2\2~221 
Academic Press, NY, NY (1996); see particularly Claverie, in "Automated DNA 

25 Sequencing and Analysis Techniques" Adams et al., eds., Chap. 36, p. 267 Academic 
Press, San Diego, 1994 and Claverie et al. Comput. Chem. (1993) 77:191 ). Generally, 
masking does not influence the final search results, except to eliminate sequences of 
relative little interest due to their low complexity, and to eliminate multiple "hits" based 
on similarity to repetitive regions common to multiple sequences, e.g., Alu repeats. The 

30 sequences remaining after masking were then used in a BLASTN vs. Genbank search; 
sequences that exhibited greater than 70% overlap, 99% identity, and a p value of less 
than 1 x 10" 40 were discarded. Sequences from this search also were discarded if the 
inclusive parameters were met, but the sequence was ribosomal or vector-derived. 
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The resulting sequences from the previous search were classified into 
three groups (1,2 and 3 below) and searched in a BLASTX vs. NRP (non-redundant 
proteins) database search: (1) unknown (no hits in the Genbank search), (2) weak 
similarity (greater than 45% identity and p value of less than 1 x 10~ 5 ), and (3) high 
5 similarity (greater than 60% overlap, greater than 80% identity, and p value less than 1 
x 10~ 5 ). Sequences having greater than 70% overlap, greater than 99% identity, and p 
value of less than 1 x 10" 40 were discarded. 

The remaining sequences were classified as unknown (no hits), weak 
similarity, and high similarity (parameters as above). Two searches were performed on 

10 these sequences. First, a BLAST vs. EST database search was performed and 
sequences with greater than 99% overlap, greater than 99% similarity and a p value of 
less than 1 x 10" 40 were discarded. Sequences with a p value of less than 1 x 10" 65 when 
compared to a database sequence of human origin were also excluded. Second, a 
BLASTN vs. Patent GeneSeq database was performed and sequences having greater 

15 than 99% identity, p value less than 1 x 10^°, and greater than 99% overlap were 
discarded. 

The remaining sequences were subjected to screening using other rules 
and redundancies in the dataset. Sequences with a p value of less than 1 x 10~ m in 
relation to a database sequence of human origin were specifically excluded. The final 

20 result provided the 3351 sequences listed in the accompanying Sequence Listing. Each 
identified polynucleotide represents sequence from at least a partial mRNA transcript. 
Polynucleotides that were determined to be novel were assigned a sequence 
identification number. 

The novel polynucleotides were assigned sequence identification numbers 

25 SEQ ID TSIOs: 1 -335 1 . The first 1847 DNA sequences corresponding to the novel 
polynucleotides are provided in the Sequence Listing in Table I. DNA sequences 
corresponding to the novel polynucleotides of SEQ ID NOs: 1848-335 J are provided in the 
Sequence Listing in Table 2. The DNA sequences of Table 2, while numbered SEQ ID 1- 
1504, correspond to SEQ ID NOs: 1848-3351 in the Sequence Listing, e.g., Table 2 SEQ ID 

30 1 is SEQ ID NO: 1 848, Table 2 SEQ ID 2 is SEQ ID NO: 1 849, etc. Each DNA sequence in 
Table 4 is uniquely identified by a number that is 1847 less than its SEQ ID NO in the 
Sequence Listing. Tables 1 and 2 provide: 1) the SEQ ID NO assigned to each sequence 
for use in the present specification or a corresponding number; 2) the sequence name used 
as an internal identifier of the sequence; 3) the name assigned to the clone from which the 



<5K 
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sequence was isolated; and 4) the number of the cluster to which the sequence is assigned 
(Cluster ID; where the cluster ID is 0, the sequence was not assigned to any cluster). 

Because the provided polynucleotides represent partial mRNA 
transcripts, two or more polynucleotides of the invention may represent different 
5 regions of the same mRNA transcript and the same gene. Thus, if two or more SEQ ID 
NOs: are identified as belonging to the same clone, then either sequence can be used to 
obtain the full-length mRNA or gene. 

EXAMPLE 2 

Results of Public Database Search to Identify Function of Gene Products 

10 

SEQ ID NOs: 1-3351 were translated in all three reading frames to 
determine the best alignment with the individual sequences. These amino acid 
sequences and nucleotide sequences are referred to, generally, as query sequences, 
which are aligned with the individual sequences. Query and individual sequences were 

15 aligned using the BLAST programs, available over the world wide web at 
http://www.ncbi.nlm.nih.gov/BLAST/. Again the sequences were masked to various 
extents to prevent searching of repetitive sequences or poly-A sequences, using the 
XBLAST program for masking low complexity as described above in Example 1 . 

Tables 3 and 4 (inserted before the claims) show the results of the 

20 alignments. Table 3 contains alignment information for SEQ ID NOs: 1-1 847 and Table 4 
contains alignment information for SEQ ID NOs: 1 848-335 1 . The DNA sequences of Table 
4, while numbered SEQ ID 1-1504, correspond to SEQ ID NOs: 1848-3351. Each DNA 
sequence in Table 4 is uniquely identified by a number that is 1847 less than its SEQ ID 
NO. Tables 3 and 4 refer to each sequence by its SEQ ID NO or a corresponding number, 

25 the accession numbers and descriptions of nearest neighbors from the Genbank and Non- 
Redundant Protein searches, and the p values of the search results. 

For each of SEQ ID NOs:l-1847, the best alignment to a protein or DNA 
sequence is included in Table 3, and the best alignment for each of SEQ ID NOs: 1 848- 
3351 is included in Table 4. The activity of the polypeptide encoded by SEQ ID 

30 NOs: 1-3351 is the same or similar to the nearest neighbor reported in Table 3 or 4. The 
accession number of the nearest neighbor is reported, providing a reference to the activities 
exhibited by the nearest neighbor. The search program and database used for the alignment 
also are indicated as well as a calculation of the p value. 
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Full length sequences or fragments of the polynucleotide sequences of 
the nearest neighbors can be used as probes and primers to identify and isolate the full 
length sequence of SEQ ID NOs: 1-3351. The nearest neighbors can indicate a tissue or 
cell type to be used to construct a library for the full-length sequences of SEQ ID 
5 NOs: 1-3351. 



EXAMPLE 3 

Members of Protein Families 



The sequences (SEQ ID NOs: 1-3351) were used to conduct a profile 
10 search as described in the specification above. Several of the polynucleotides of the 
invention were found to encode polypeptides having characteristics of a polypeptide 
belonging to a known protein families (and thus represent new members of these 
protein families) and/or comprising a known functional domain (Table 5). "Start" and 
"stop" in Table 3 indicate the position within the individual sequences that align with 
15 the query sequence having the indicated SEQ ID NO. The direction indicates the 
orientation of the query sequence with respect to the individual sequence, where 
forward (for) indicates that the alignment is in the same direction (left to right) as the 
sequence provided in the Sequence Listing and reverse (rev) indicates that the 
alignment is with a sequence complementary to the sequence provided in the Sequence 
20 Listing. 

Some polynucleotides exhibited multiple profile hits because, for 
example, the particular sequence contains overlapping profile regions, and/or the 
sequence contains two different functional domains. These profile hits are described in 
more detail below. 

25 Ank Repeats (ANK) . SEQ ID NOs:187, 1268, 1804, 1819, 1830, 1839, 

2652, 3015 and 3267 represent polynucleotides encoding an Ank repeat-containing 
protein. The ankyrin motif is a 33 amino acid sequence named for the protein ankyrin 
which has 24 tandem 33-amino-acid motifs. Ank repeats were originally identified in 
the cell-cycle-control protein cdclO (Breeden et al., Nature (1987) 529:651). Proteins 

30 containing ankyrin repeats include ankyrin, myotropin, I-kappaB proteins, cell cycle 
protein cdclO, the Notch receptor (Matsuno et al., Development (1997) 724(2/):4265); 
G9a (or BAT8) of the class III region of the major histocompatibility complex 
(Biochem 1 290:811-818, 1993), FABP, GABP, 53BP2, Linl2, glp-1, SW14, and 
SW16. The functions of the ankyrin repeats are compatible with a role in protein- 
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protein interactions (Bork, Proteins (1993) 77(4):363; Lambert and Bennet, Eur. J. 
Biochem. (1993) 277:1; Kerr et al., Current Op. Cell Biol (1992) 4:496; Bennet et al., 
J. Biol Chem. (1980) 255:6424). 



5 Sequences within SEQ ID NOs:431, 639, 2135, 2684, 2859, 3197 and 3266 correspond 
to a sequence that encodes a novel member of the "ATPases Associated with diverse 
cellular Activities" (AAA) protein family. The AAA protein family is composed of a 
large number of ATPases that share a conserved region of about 220 amino acids that 
contains an ATP-binding site (Froehlich et ah, J. Cell Biol (1991) 774:443; Erdmann et 

10 al., Cell (1991) 54:499; Peters et al., EMBO 1 (1990) 9:1757; Kunau et al., Biochimie 
(1993) 75:209-224; Confalonieri et ah, BioEssays (1995) 77:639; 
http://yeamob.pci.chemie.uni-tuebingen.de/AAA/Description.html). The proteins that 
belong to this family either contain one or two AAA domains. In general, the AAA 
domains in these proteins act as ATP-dependent protein clamps (Confalonieri et al. 

15 (1995) BioEssays 77:639). In addition to the ATP-binding W and 'B' motifs, which are 
located in the N-terminal half of this domain, there is a highly conserved region located 
in the central part of the domain which was used in the development of the signature 
pattern. The consensus pattern is: [LIVMT]-x-[LIVMT]-[LIVMF]-x-[GATMC]-[ST]- 
[NS]-x(4)-[LIVM]- D-x-A-[LIFA]-x-R. 

20 Bromodomain (bromodomain) . SEQ ID NO: 18 14 represents a 

polynucleotide encoding a polypeptide having a bromodomain region (Haynes et al., 
1992, Nucleic Acids Res. 20:2693-2603, Tamkun et al., 1992, Cell 68:561-572, and 
Tamkun, 1995, Curr. Opin. Genet. Dev. 5:473-477), which is a conserved region of 
about 70 amino acids. The bromodomain is thought to be involved in protein-protein 

25 interactions and may be important for the assembly or activity of multicomponent 
complexes involved in transcriptional activation. The consensus pattern, which spans a 
major part of the bromodomain, is: [STANVF]-x(2)-F-x(4)-[DNS]-x(5,7)-[DENQTF]- 
Y-[HFY]-x(2)- [LIVMFY]-x(3)-[LIVM]-x(4)-[LIVM]-x(6,8)-Y-x(12,13)-[LIVM]-x(2)- 
N-[SACF]-x(2)-[FY]. 

30 Basic Region Plus Leucine Zipper Transcription Factors (BZIP) . SEQ 

ID NOs:410, 552, 768, 822, 836, 1288, 1365, 1454, 1540, 1549, 1556, 1557, 1563, 
1622, 1630, 1704, 1808, 2363, 2424, 3147, 3152, 3158 and 3208 represent 
polynucleotides encoding a novel member of the family of basic region plus leucine 
zipper transcription factors. The bZIP superfamily (Hurst, Protein Prof. (1995) 2:105; 

35 and Ellenberger, Curr. Opin. Struct. Biol (1994) 4:12) of eukaryotic DNA-binding 



ATPases Associated with Various Cellular Activities (ATPases) . 
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transcription factors encompasses proteins that contain a basic region mediating 
sequence-specific DNA -binding followed by a leucine zipper required for dimerization. 
The consensus pattern for this protein family is: [KR]-x(l,3)-[RKSAQ]-N-x(2)- 
[SAQ](2)-x-[RKTAENQ]-x-R-x-[RK], 
5 EF Hand (EFhand) . SEQ ID NOs:820, 1755 and 3285 correspond to 

polynucleotides encoding a novel protein in the family of EF-hand proteins. Many 
calcium-binding proteins belong to the same evolutionary family and share a type of 
calcium-binding domain known as the EF-hand (Kawasaki et al., Protein. Prof. (1995) 
2:305-490). This type of domain consists of a twelve residue loop flanked on both sides 

10 by a twelve residue alpha-helical domain. In an EF-hand loop the calcium ion is 
coordinated in a pentagonal bipyramidal configuration. The six residues involved in the 
binding are in positions 1, 3, 5, 7, 9 and 12; these residues are denoted by X, Y, Z, -Y, 
-X and -Z. The invariant Glu or Asp at position 12 provides two oxygens for liganding 
Ca (bidentate ligand). The consensus pattern includes the complete EF-hand loop as 

15 well as the first residue which follows the loop and which seem to always be 
hydrophobic: D-x-[DNS]-{ILVFYW}-[DENSTG]-[DNQGHRK]-{GP}-[LIVMC]- 
[DENQSTAGC]-x(2)-[DE]-[LIVMFYW]. 

Ets Domain (Ets Nterm) . SEQ ID NO:1811 represents a polynucleotide 
encoding a polypeptide with N-terminal homology in ETS domain. Proteins of this 

20 family contain a conserved domain, the "ETS-domain," that is involved in DNA 
binding. The domain appears to recognize purine-rich sequences; it is about 85 to 90 
amino acids in length, and is rich in aromatic and positively charged residues (Wasylyk, 
et al., Eur. J. Biochem. (1993) 277:718). The ets gene family encodes a novel class of 
DNA-binding proteins, each of which binds a specific DNA sequence and comprises an 

25 ets domain that specifically interacts with sequences containing the common core tri- 
nucleotide sequence GGA. In addition to an ets domain, native ets proteins comprise 
other sequences which can modulate the biological specificity of the protein. Ets genes 
and proteins are involved in a variety of essential biological processes including cell 
growth, differentiation and development, and three members are implicated in 

30 oncogenic process. 

G-Protein Alpha Subunit (G-alpha) . SEQ ID NO: 1846 represents a 
polynucleotide encoding a novel polypeptide of the G-protein alpha subunit family. 
Guanine nucleotide binding proteins (G -proteins) are a family of membrane-associated 
proteins that couple extracellularly-activated integral-membrane receptors to 

35 intracellular effectors, such as ion channels and enzymes that vary the concentration of 

5<e 
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second messenger molecules. G-proteins are composed of 3 subunits (alpha, beta and 
gamma) which, in the resting state, associate as a trimer at the inner face of the plasma 
membrane. The alpha subunit binds GTP and exhibits GTPase activity. G-protein alpha 
subunits are 350-400 amino acids in length and have molecular weights in the range 40- 
5 45 kDa. Seventeen distinct types of alpha subunit have been identified in mammals, 
and fall into 4 main groups on the basis of both sequence similarity and function: alpha- 
s, alpha-q, alpha-i and alpha-12 (Simon et al., Science (1993) 252:802). They are often 
N-terminally acylated, usually with myristate and/or palmitoylate, and these fatty acid 
modifications can be important for membrane association and high- affinity interactions 

1 0 with other proteins. 

Helicases conserved C-terminal domain (helicase C\ SEQ ID 
NOs:1496, 2826 and 2871 represent polynucleotides encoding novel members of the 
DEAD/H helicase family. A number of eukaryotic and prokaryotic proteins have been 
characterized (Schmid S.R., et al., Mol Microbiol (1992) (5:283; Linder P., et al., 

15 Nature (1989) 557:121; Wassarman D.A., et al, Nature (1991) 549:463) on the basis of 
their structural similarity. All are involved in ATP -dependent, nucleic-acid unwinding. 
All DEAD box family members of the above proteins share a number of conserved 
sequence motifs, some of which are specific to the DEAD family while others are 
shared by other ATP-binding proteins or by proteins belonging to the helicases 

20 'superfamily' (Hodgman T.C, Nature (1988) 555:22 and Nature (1988) 555:578 
(Errata). One of these motifs, called the "D-E-A-D-box", represents a special version of 
the B motif of ATP-binding proteins. Some other proteins belong to a subfamily which 
have His instead of the second Asp and are thus said to be "D-E-A-H-box" proteins 
(Wassarman D.A., et al., Nature (1991) 549:463; Harosh I., et al., Nucleic Acids Res. 

25 (1991) 79:6331; Koonin E.V. et al., J. Gen. Virol. (1992) 75:989. The following 
signature patterns are used to identify members of both subfamilies: 1) [LIVMF](2)-D- 
E-A-D-[RKEN]-x-[LIVMFYGSTN]; and 2) [GSAH]-x-[LIVMF](3)-D-E-[ALIV]-H- 
[NECR]. 

Homeobox domain fhomeoboxV SEQ ID NOs:1676, 1820 and 1821 
30 represent polynucleotides encoding proteins having a homeobox domain. The 
homeobox is a protein domain of 60 amino acids (Gehring In: Guidebook to the 
Homeobox Genes, Duboule D., Ed., pp. 1-10, Oxford University Press, Oxford, (1994); 
Buerglin In: Guidebook to the Homeobox Genes , pp25-72, Oxford University Press, 
Oxford, (1994); Gehring, Trends Biochem. Sci (1992) 17:277-280; Gehring et al., 
35 Annu. Rev. Genet. (1986) 20:147-173; Schofield, Trends NeuroscL (1987) 70:3-6) first 

51 
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identified in a number of Drosophila homeotic and segmentation proteins. It is 
extremely well conserved in many other animals, including vertebrates. This domain 
binds DNA through a helix-turn-helix type of structure. Several proteins that contain a 
homeobox domain play an important role in development. Most of these proteins are 
5 sequence-specific DNA-binding transcription factors. The homeobox domain is also 
very similar to a region of the yeast mating type proteins. These are sequence-specific 
DNA-binding proteins that act as master switches in yeast differentiation by controlling 
gene expression in a cell type-specific fashion. 

A schematic representation of the homeobox domain is shown below. 
10 The helix-turn-helix region is shown by the symbols 'H' (for helix), and 't' (for turn). 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxHHHHHHHHtttHHHHHHHHHxxxxxxxxxx 
1 60 

15 The pattern detects homeobox sequences 24 residues long and spans 

positions 34 to 57 of the homeobox domain. The consensus pattern is as follows: 
[LIVMFYG]-[ASLVR]-x(2)-[LIVMSTACN]-x-[LIVM]-x(4)-[LIV]-[RKNQESTAlY]- 
[LIVFSTNKJ4]-W-[FYVC]-x-|>JDQTAH]-x(5)-[RKNAIMW]. 

MAP kinase kinase (mkk) . SEQ ID NOs:29, 31, 196, 3175, 3190 and 

20 3281 represent novel members of the MAP kinase kinase family. MAP kinases 
(MAPK) are involved in signal transduction, and are important in cell cycle and cell 
growth controls. The MAP kinase kinases (MAPKK) are dual-specificity protein 
kinases which phosphorylate and activate MAP kinases. MAPKK homologues have 
been found in yeast, invertebrates, amphibians, and mammals. Moreover, the 

25 MAPKK/MAPK phosphorylation switch constitutes a basic module activated in distinct 
pathways in yeast and in vertebrates. MAPKKs are essential transducers through which 
signals must pass before reaching the nucleus. For review, see, e.g., Biologique Biol 
Cell (1993) 79:193-207; Nishida et al., Trends Biochem Sci (1993) 75:128-31; 
Ruderman, Curr Opin Cell Biol (1993) 5:207-13; Dhanasekaran et al., Oncogene (1998) 

30 77:1447-55; Kieferetal., Biochem Soc Trans (1997) 25:491-8; and Hill, Cell Signal 
(1996) 5:533-44. 

Protein Kinase (protkinase) . SEQ ID NOs: 1 1 57, 1478, 1 496, 2286, 2969 
and 3190 represent polynucleotides encoding protein kinases. Protein kinases catalyze 
phosphorylation of proteins in a variety of pathways, and are implicated in cancer. 
35 Eukaryotic protein kinases (Hanks S.K., et al., FASEB 1 (1995) 9:576; Hunter T., Melh 
Enzymol (1991) 200:3; Hanks S.K., et al., Meth. Enzymol (1991) 200:38; Hanks S.K., 



W 00102568 [file://C:\Pocuments and Settings\Lawrence\Local Settinqs\Temporary Internet Files\OLK11\WO0102568.cpcl 

WO 01/02568 PCT/US00/18374 



Page 60 of 1046 



Curr. Opin. Struct. Biol (1991) 7:369; Hanks S.K. et al, Science (1988) 241:42) are 
enzymes that belong to a very extensive family of proteins which share a conserved 
catalytic core common to both serine/threonine and tyrosine protein kinases. There are 
a number of conserved regions in the catalytic domain of protein kinases. The first 
5 region, which is located in the N-terminal extremity of the catalytic domain, is a 
glycine-rich stretch of residues in the vicinity of a lysine residue, which has been shown 
to be involved in ATP binding. The second region, which is located in the central part 
of the catalytic domain, contains a conserved aspartic acid residue which is important 
for the catalytic activity of the enzyme (Knighton D.R. et al., Science (1991) 253:407). 

10 The protein kinase profile includes two signature patterns for this second region: one 
specific for serine/threonine kinases and the other for tyrosine kinases. A third profile 
is based on the alignment in (Hanks S.K. et al., FASEB 7. (1995) 9:576) and covers the 
entire catalytic domain. 

The consensus patterns are as follows: 1) [LIV]-G-{P}-G-{P}- 

15 [FYWMGSTNH]-[SGA]-{PW}-[LIVCAT]-{PD}-x-[GSTACLIVMFY]-x(5,18)- 

[LIVMFYWCSTAR]-[AIVP]-[LIVMFAGCKR]-K, where K binds ATP; 2) 
[LIVMFYC]-x-[HY]-x-D-[LIVMFY]-K-x(2)-N-[LIVMFYCT](3), where D is an active 
site residue; and 3) [LIVMFYC]-x-[HY]-x-D-[LIVMFY]-[RSTAC]-x(2)-N- 
[LIVMFYC], where D is an active site residue. 

20 If a protein analyzed includes two of the above protein kinase signatures, 

the probability of it being a protein kinase is close to 100%. 

Ras family proteins (ras) . SEQ ID NOs:1688 and 3258 represent 
polynucleotides encoding novel members of the ras family of small GTP/GDP-binding 
proteins (Valencia et al., 1991, Biochemistry 30:4637-4648). Ras family members 

25 generally require a specific guanine nucleotide exchange factor (GEF) and a specific 
GTPase activating protein (GAP) as stimulators of overall GTPase activity. Among 
ras-related proteins, the highest degree of sequence conservation is found in four 
regions that are directly involved in guanine nucleotide binding. The first two 
constitute most of the phosphate and Mg2+ binding site (PM site) and are located in the 

30 first half of the G-domain. The other two regions are involved in guanosine binding and 
are located in the C-terminal half of the molecule. Motifs and conserved structural 
features of the ras-related proteins are described in Valencia et al., 1991, Biochemistry 
30:4637-4648. A major consensus pattern of ras proteins is: D-T-A-G-Q-E-K-[LF]-G- 
G-L-R-[DE)-G-Y-Y. 
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Thioredoxin family active site (Thioredox) . SEQ ID NO: 1677 represents 
a polynucleotide encoding a protein having a thioredoxin family active site. 
Thioredoxins (Holmgren A., Annu. Rev. Biochem. (1985) 54:237; Gleason F.K. et al., 
FEMS Microbiol. Rev. (1988) 54:271; Holmgren, A J. Biol. Chem. (1989) 264:13963; 
5 Eklund H. et al., Proteins (1991) 77:13) are small proteins of approximately one 
hundred amino- acid residues which participate in various redox reactions via the 
reversible oxidation of an active center disulfide bond. They exist in either a reduced 
form or an oxidized form where the two cysteine residues are linked in an 
intramolecular disulfide bond. Thioredoxin is present in prokaryotes and eukaryotes 
10 and the sequence around the redox-active disulfide bond is well conserved. All PDI 
contains two or three (ERp72) copies of the thioredoxin domain. The consensus pattern 
is: [LIVMF]-[LIVMSTA]-x-[LIVMFYC]-[FYWSTHE]-x(2)-[FYWGTN]-C- 
[GATPLVE]-[PHYWSTA]-C-x(6)-[LIVMFYWT] (where the two C's form the redox- 
active bond). 

15 Trypsin (trypsin) . SEQ ID NO: 14 10 corresponds to a novel serine 

protease of the trypsin family. The catalytic activity of the serine proteases from the 
trypsin family is provided by a charge relay system involving an aspartic acid residue 
hydrogen-bonded to a histidine, which itself is hydrogen-bonded to a serine. The 
sequences in the vicinity of the active site serine and histidine residues are well 

20 conserved in this family of proteases (Brenner S., Nature (1988) 354:528). The 
consensus patterns for this trypsin protein family are: 1) [LIVM]-[ST]-A-[STAG]-H-C, 
where H is the active site residue; and 2) [DNSTAGC]-[GSTAPIMVQH]-x(2)-G-[DE]- 
S-G-[GS]-[SAPHV]- [LIVMFYWH]-[LIVMFYSTANQH], where S is the active site 
residue. All sequences known to belong to this family are detected by the above 

25 consensus sequences, except for 18 different proteases which have lost the first 
conserved glycine. If a protein includes both the serine and the histidine active site 
signatures, the probability of it being a trypsin family serine protease is 100%. 

WD Domain, G-Beta Repeats (WD domain) . SEQ ID NOs:1336, 1380, 
1711, 1762, 1909, 2218, 3047, 3108 and 3292 represent novel members of the WD 

30 domain/G-beta repeat family. Beta-transducin (G-beta) is one of the three subunits 
(alpha, beta, and gamma) of the guanine nucleotide-binding proteins (G proteins) which 
act as intermediaries in the transduction of signals generated by transmembrane 
receptors (Gilman, Annu. Rev. Biochem. (1987) 56:615). The alpha subunit binds to 
and hydrolyzes GTP; the functions of the beta and gamma subunits are less clear but 

35 they seem to be required for the replacement of GDP by GTP as well as for membrane 
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anchoring and receptor recognition. In higher eukaryotes, G-beta exists as a small 
multigene family of highly conserved proteins of about 340 amino acid residues. 
Structurally, G-beta consists of eight tandem repeats of about 40 residues, each 
containing a central Trp-Asp motif (this type of repeat is sometimes called a WD-40 
5 repeat). The consensus pattern for the WD domain/G-Beta repeat family is: 
[LIVMSTAC]-[LIVMFYWSTAGC]-[LIMSTAG]-[LIVMSTAGC]-x(2)-[DN]-x(2)- 
[LIVMWSTAC]-x-tLIVMFSTAG]-W-[DEN]-[LIVMFSTAGCN]. 

wnt Family of Developmental Signaling Proteins (Wnt dev sign) . SEQ 
ID NO: 1538 corresponds to a novel member of the wnt family of developmental 

10 signaling proteins. Wnt-1 (previously known as int-1), the seminal member of this 
family, (Nusse R., Trends Genet, (1988) 4:291) is thought to play a role in intercellular 
communication and seems to be a signalling molecule important in the development of 
the central nervous system (CNS). All wnt family proteins share the following features 
characteristics of secretory proteins: a signal peptide, several potential N-glycosylation 

15 sites and 22 conserved cysteines that are probably involved in disulfide bonds. The 
Wnt proteins seem to adhere to the plasma membrane of the secreting cells and are 
therefore likely to signal over only few cell diameters. The consensus pattern, which is 
based upon a highly conserved region including three cysteines, is as follows: C-K-C- 
H-G-[LIVMT]-S-G-x-C. 

20 Protein Tyrosine Phosphatase (Y phosphatase) . SEQ ID NO:1417 

represents a polynucleotide encoding a protein tyrosine kinase. Tyrosine specific 
protein phosphatases (EC 3.1.3.48) (PTPase) (Fischer et aL, Science (1991) 255:401; 
Charbonneau et aL, Annu. Rev. Cell Biol. (1992) 5:463; Trowbridge, J. Biol. Chem. 
(1991) 266:23517; Tonks et aL, Trends Biochem. Sci. (1989) 14:497; and Hunter, Cell 

25 (1989) 55:1013) catalyze the removal of a phosphate group attached to a tyrosine 
residue. These enzymes are very important in the control of cell growth, proliferation, 
differentiation and transformation. Multiple forms of PTPase have been characterized 
and can be classified into two categories: soluble PTPases and transmembrane receptor 
proteins that contain PTPase domain(s). Structurally, all known receptor PTPases are 

30 made up of a variable length extracellular domain, followed by a transmembrane region 
and a C-terminal catalytic cytoplasmic domain. PTPase domains consist of about 300 
amino acids. The search of two conserved cysteines has been shown to be absolutely 
required for activity. Furthermore, a number of conserved residues in its immediate 
vicinity have also been shown to be important. The consensus pattern for PTPases is: 

35 [LIVMF]-H-C-x(2)-G-x(3)-[STC]-[STAGP]-x-[LIVMFY]; C is the active site residue. 

apt 
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Zinc Finger. C2H2 Type (Zincfing C2H2V SEQ ID NOs:308, 807, 
1324, 1503, 1527, 3081, 3193 and 3306 correspond to polynucleotides encoding novel 
members of the of the C2H2 type zinc finger protein family. Zinc finger domains (Klug 
et aL, Trends Biochem. ScL (1987) 72:464; Evans et aL, Cell (1988) 52:1; Payre et aL, 
5 FEBS Lett. (1988) 254:245; Miller et al., EMBOJ. (1985) 4:1609; and Berg, Proc. Natl 
Acad. Sci. USA (1988) 55:99) are nucleic acid-binding protein structures. In addition to 
the conserved zinc ligand residues, it has been shown that a number of other positions 
are also important for the structural integrity of the C2H2 zinc fingers. (Rosenfeld et al., 
J, Biomol Struct. Dyn. (1993) 77:557) The best conserved position is found four 

10 residues after the second cysteine; it is generally an aromatic or aliphatic residue. The 
consensus pattern for C2H2 zinc fingers is: C-x(2,4)-C-x(3)-[LIVMFYWC]-x(8)-H- 
x(3,5)-H. The two Cs and two H's are zinc ligands. 

Src homology 2 . SEQ ID NOs:186, 2591, 3307 and 3339 represent 
polynucleotides encoding novel members of the family of Src homology 2 (SH2) 

15 proteins. The Src homology 2 (SH2) domain is a protein domain of about 100 amino 
acid residues first identified as a conserved sequence region between the oncoproteins 
Src and Fps (Sadowski I. et al., Mol. Cell. Biol. 6:4396-4408 (1986)). Similar sequences 
are found in many other intracellular signal-transducing proteins (Russel R.B. et al., 
FEBS Lett. 304:15-20 (1992)). SH2 domains function as regulatory modules of 

20 intracellular signalling cascades by interacting with high affinity to phosphotyrosine- 
containing target peptides in a sequence-specific and phosphorylation-dependent 
manner (Marangere L.E.M., Pawson T., J. Cell Sci. Suppl 75:97-104 (1994); Pawson 
T., Schlessinger J., Curr. Biol 5:434-442 (1993); Mayer B.J., Baltimore D., Trends 
Cell Biol 3:8-13 (1993); Pawson T., Nature 57i:573-580 (1995)). 

25 The SH2 domain has a conserved 3D structure consisting of two alpha 

helices and six to seven beta-strands. The core of the domain is formed by a continuous 
beta-meander composed of two connected beta-sheets (Kuriyan J., Cowburn D., Curr. 
Opin. Struct. Biol 5:828-837(1993)). The profile to detect SH2 domains is based on a 
structural alignment consisting of 8 gap-free blocks and 7 linker regions totaling 92 

30 match positions. 

Src homology 3. SEQ ID NO:234, 1832, and 1835 represent 
polynucleotides encoding novel members of the family of Src homology 3 (SH3) 
proteins. The Src homology 3 (SH3) domain is a small protein domain of about 60 
amino acid residues first identified as a conserved sequence in the non-catalytic part of 

35 several cytoplasmic protein tyrosine kinases (e.g., Src, Abl, Lck) (Mayer B.J. et al., 

(9t 
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Nature 552:272-275 (1988)). Since then, it has been found in a great variety of other 
intracellular or membrane-associated proteins (Musacchio A. et al., FEBS Lett. 307:55- 
61 (1992); Pawson T., Schlessinger J., Curr. Biol. 3:434-442 (1993); Mayer B.J., 
Baltimore D., Trends Cell Biol. 5:8-13 (1993); Pawson T., Nature 575:573-580 (1995)). 
5 The SH3 domain has a characteristic fold which consists of five or six 

beta strands arranged as two tightly packed anti-parallel beta sheets. The linker regions 
may contain short helices (Kuriyan J., Cowburn D., Curr. Opin. Struct Biol 5:828-837 
(1993)). 

The function of the SH3 domain may be to mediate assembly of specific 
10 protein complexes via binding to proline-rich peptides (Morton C.J., Campbell I.D., 
Curr. Biol 4:615-617 (1994)). 

In general SH3 domains are found as single copies in a given protein, but 
there are a significant number of proteins with two SH3 domains and a few with 3 or 4 
copies. 

15 Fibronectin type HI. SEQ ID NOs:746 and 1192 represent 

polynucleotides encoding novel members of the family of fibronectin type III proteins. 
A number of receptors for lymphokines, hematopoeitic growth factors and growth 
hormone-related molecules have been found to share a common binding domain. 
(Bazan J.F., Biochem. Biophys. Res. Commun. /<W:788-795 (1989); Bazan J.F., Proc. 

20 Natl. Acad. Sci. U.S.A. 57:6934-6938 (1990); Cosman D. et al., Trends Biochem. Sci. 
75:265-270 (1990); d r Andrea A.D., Fasman G.D., Lodish H.F., Cell 55:1023-1024 

(1989) ; d f Andrea A.D., Fasman G.D., Lodish H.F., Curr. Opin. Cell Biol. 2:648-651 

(1990) ). 

The conserved region constitutes all or part of the extracellular ligand- 
25 binding region and is about 200 amino acid residues long. In the N-terminal of this 
domain there are two pairs of cysteines known, in the growth hormone receptor, to be 
involved in disulfide bonds. 

+ xxxxxxx + 

30 |C C C C Extracellular XXXXXXX Cytoplasmic | + 

_|_| |__| xxxxxxx + 

jj || Transmembrane 

+-+ +--+ 

35 Two patterns detect this family of receptors. The first one is derived 

from the first N-terminal disulfide loop, the second is a tryptophan-rich pattern located 
at the C-terminal extremity of the extracellular region. 
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A consensus for this protein family is: C-[LVFYR]-x(7,8)-[STIVDN]-C- 
x-W (The two C's are linked by a disulfide bond]. A second consensus for this protein 
family is: [STGL]-x-W-[SG]-x-W-S. 

LIM domain containing proteins. SEQ ID NOs:1269, 1309, 1360, and 
5 1386 represent polynucleotides encoding novel members of the family of LIM domain 
containing proteins. A number of proteins contain a conserved cysteine-rich domain of 
about 60 amino-acid residues. (Freyd G. et al., Nature 544:876-879 (1990); Baltz R. et 
al., Plant Cell 4:1465-1466 (1992); Sanchez-Garcia I., Rabbitts T.H., Trends Genet. 
70:315-320(1994)). 

10 In the LIM domain, there are seven conserved cysteine residues and a 

histidine. The arrangement followed by these conserved residues is C-x(2)-C- x(16,23)- 
H-x(2)-[CH]-x(2)-C-x(2)-C-x(16 ; 21)-C-x(2,3)-[CHD]. The LIM domain binds two zinc 
ions (Michelsen J.W. et al., Proc. Natl Acad. Scl U.S.A. 90:4404-4408 (1993)). LIM 
does not bind DNA, rather it seems to act as interface for protein-protein interaction. 

15 The consensus for this protein family is: C-x(2)-C-x( 15,21 )-[FY WH]-H-x(2)-[CH]- 
x(2)-C-x(2)-C-x(3)- [LI VMF]. The 5 C's and the H bind zinc. 

C2 domain (protein kinase C like). SEQ ID NOs:1325 and 2282 
represent polynucleotides encoding novel members of the family of C2 domain 
containing proteins. Some isozymes of protein kinase C (PKC) contain a domain, 

20 known as C2, of about 116 amino-acid residues, which is located between the two 
copies of the CI domain (that bind phorbol esters and diacylglycerol) and the protein 
kinase catalytic domain. (Azzi A. et al., Eur. J, Biochem. 205:547-557 (1992); Stabel S., 
Semin. Cancer Biol. 5:277-284 (1994)). 

The C2 domain is involved in calcium-dependent phospholipid binding 

25 (Davletov B.A., Suedhof T.C., J. Biol. Chem. 265:26386-26390 (1993)). Since 
domains related to the C2 domain are also found in proteins that do not bind calcium, 
other putative functions for the C2 domain include binding to inositol- 1,3,5- 
tetraphosphate. (Fukuda M., et al., J. Biol. Chem. 269:29206-2921 1 (1994).) 

The consensus pattern for the C2 domain is located in a conserved part 

30 of that domain, the connecting loop between beta strands 2 and 3. The profile for the C2 
domain covers the total domain. The consensus for this protein family is:: [ACG]-x(2)- 
L-x(2,3)-D-x(l,2)-[NGSTLIF]-[GTMR]-x-[STAP]-D- [PA]-[FY] 

Serine proteases, trypsin family, active sites. SEQ ID NO:1410 
represents a polynucleotide encoding a novel member of the family of serine protease, 

35 trypsin proteins. The catalytic activity of the serine proteases from the trypsin family is 
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provided by a charge relay system involving an aspartic acid residue hydrogen-bonded 
to a histidine, which itself is hydrogen-bonded to a serine. The sequences in the vicinity 
of the active site serine and histidine residues are well conserved in this family of 
proteases (Brenner S., Nature 554:528-530 (1988)). 
5 A consensus for this protein family is: [LIVM]-[ST]-A-[STAG]-H-C [H 

is the active site residue]. A second consensus for this protein family is: [DNSTAGC]- 
[GSTAPIMVQH]-x(2)-G-[DE]-S-G-[GS]-[SAPHV]- [LIVMFYWH]- 
[LIVMFYSTANQH] [S is the active site residue]. 

RNA Recognition Motif Domain (RRM RBD, or RNP). SEQ ID NOs: 

10 1464 and 1514 represent polynucleotides encoding novel members of the family of 
RNA recognition motif domain proteins (Bandziulis R.J. et al., Genes Dev. 5:431-437 
(1989); Dreyfuss G. et aL, Trends Biochem. Sci 75:86-91 (1988)). 

Inside the putative RNA-binding domain there are two regions which are 
highly conserved. The first one is a hydrophobic segment of six residues (which is 

15 called the RNP-2 motif); the second one is an octapeptide motif (which is called RNP-1 
or RNP-CS). The position of both motifs in the domain is shown in the following 
schematic representation: 

xxxxxxx######xxxxxxxxxxxxxxxxxxxxxxxxxxxxx#######xxxxxxxxxxx 
20 RNP-2 RNP-1 

As a consensus pattern for this type of domain the RNP-1 motif was 
used. The consensus for this protein family is: [RK]-G-{EDRKHPCG}-[AGSCI]- 
[FY]-[LIVA]-x-[FYLM] 

25 Phosphatidylinositol-specific phospholipase C, Y Domain. SEQ ID NO: 

1707 represents a polynucleotide encoding a novel member of the phosphatidylinositol- 
specific phospholipase C, Y domain family of proteins. Phosphatidylinositol-specific 
phospholipase C (EC3.1.4.1 1), a eukaryotic intracellular enzyme, plays an important 
role in signal transduction processes (Meldrum E. et al., Biochim. Biophys. Ada 

30 7092:49-71 (1991)). It catalyzes the hydrolysis of 1-phosphatidyl-D-myo-inositol- 
3,4,5- triphosphate into the second messenger molecules diacylglycerol and inositol - 
1,4,5-triphosphate. This catalytic process is tightly regulated by reversible 
phosphorylation and binding of regulatory proteins (Rhee S.G., Choi K.D., Adv. Second 
Messenger Phosphoprotein Res. 26:35-61 (1992); Rhee S.G., Choi K.D., J. Biol Chem. 

35 267:12393-12396 (1992); Sternweis P.C., Smrcka A.V., Trends Biochem. Sci 77:502- 
506(1992)). 
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All eukaryotic PI -PLCs contain two regions of homology, referred to as 
"X-box" and " Y-box". The order of these two regions is the same (NH2-X-Y-COOH), 
but the spacing is variable. In most isoforms, the distance between these two regions is 
only 50-100 residues but in the gamma isoforms one PH domain, two SH2 domains, 
5 and one SH3 domain are inserted between the two PLC-specific domains. The two 
conserved regions have been shown to be important for the catalytic activity. At the C- 
terminal of the Y-box, there is a C2 domain possibly involved in Ca-dependent 
membrane attachment. 

Serine Carboxvpeptidases. SEQ ID NO: 1744 represents a 

10 polynucleotide encoding a novel member of the serine carboxypeptidases family of 
proteins. Carboxypeptidases may be either metallo carboxypeptidases or serine 
carboxypeptidases (EC 3.4.16.5 and EC 3.4.16.6). The catalytic activity of the serine 
carboxypeptidases, like that of the trypsin family serine proteases, is provided by a 
charge relay system involving an aspartic acid residue hydrogen-bonded to a histidine, 

15 which is itself hydrogen-bonded to a serine (Liao D.I., Remington S.J., J. Biol Chem. 
265:6528-6531 (1990)). 

The sequences surrounding the active site serine and histidine residues 
are highly conserved in all these serine carboxypeptidases. A consensus for this protein 
family is: [LIVM]-x-[GTA]-E-S-Y-[AG]-[GS] [S is the active site residue]. A second 

20 consensus for this protein family is: [LIVF]-x(2)-[LIVSTA]-x-[IVPST]-x-[GSDNQL]- 
[SAGV]-[SG]-H-x- [IVAQ]-P-x(3)-[PSA] [H is the active site residue]. 

dsrm Double-Stranded RNA Binding Motif. SEQ ID NO:1818 
represents a polynucleotide encoding a novel member of the dsrm double-stranded 
RNA binding motif proteins. In eukaryotic cells, a multitude of RNA-binding proteins 

25 play key roles in the posttranscriptional regulation of gene expression. Characterization 
of these proteins has led to the identification of several RNA-binding motifs. Several 
human and other vertebrate genetic disorders are caused by aberrant expression of 
RNA-binding proteins. (C. G. Burd & G. Dreyfuss, Science 265: 615-621 (1994)). 

Proteins containing double stranded RNA binding motifs bind to specific 

30 RNA targets. Double stranded RNA binding motifs are exemplified by interferon- 
induced protein kinase in humans, which is part of the cellular response to dsRNA. 

SEQ ID NOs:2577, 3183 and 3195 encode members of the 4 trans- 
membrane integral membrane protein family. This family consists of type III proteins, 
which are integral membrane proteins that contain a N-terminal membrane-anchoring 

35 domain that is not cleaved during biosynthesis, and which functions as a translocation 
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signal and a membrane anchor. The proteins also have three additional transmembrane 
regions. The consensus pattern is: G-x(3)-[LIVMF]-x(2)-[GSA]-[LIVMF] (2)-G-C-x- 
[GA]-[STA]-x(20-[eG]-x(20-[CwN]-[LIVM](2). 

SEQ ID NO: 2944 encodes a polypeptide having a calpain large subunit, 
5 domain III. Calpains are a family of intracellular proteases that play a variety of 
biological roles. Calpain 3, also known as p94, is predominantly expressed in skeletal 
muscle and plays a role in limb-girdle muscular dystrophy type 2A. (Sorimachi, H. et 
al M Biochem. J. 328:721-732, 1997). 

SEQ ID NOs:191 1 and 1980 encode polypeptides having a C3HC4 type 

10 zinc finger domain (RING finger), which is a cysteine-rich domain of 40 to 60 residues 
that binds two atoms of zinc, and is believed to be involved in mediating protein-protein 
interactions. Mammalian proteins of this family include V(D)J recombination 
activating protein, which activates the rearrangement of immunoglobulin and T-cell 
receptor genes; breast cancer type 1 susceptibility protein (BRCA1); bmi-1 proto- 

15 oncogene; cbl proto-oncogene; and mel-18 protein, which is expressed in a variety of 
tumor cells and is a transcriptional repressor that recognizes and binds a specific DNA 
sequence. The consensus pattern is: C-x-H-x-[LIVMFY]-C-x(2)-C-[LIVMYA]. 

SEQ ID NO:3274 encodes a eukaryotic transcription factor with a fork 
head domain, of about 100 amino acid residues. Proteins of this group are transcription 

20 factors, including mammalian transcription factors HNF-3 -alpha, -beta, and -gamma; 
interleukin-enhancer binding factor; and HTLF, which binds to a region of human T- 
cell leukemia virus long terminal repeat. The consensus pattern is [KR]-P-[PTQ]- 
[FYLVQH]-S-[FY]x(2)-[LIVM]-X(3,4)-[AC]-[LIM]. 

SEQ ID NO:3345 encodes a polypeptide having a PDZ domain. Several 

25 dozen signaling proteins belong to this group of proteins that have 80-100 residue 
repeats known as PDZ domains. Several of the proteins interact with the C-terminal 
tetrapeptide motifs X-Ser/Thr/X-Val-COO- of ion channels and/or receptors. (Ponting, 
C. P., Protein Sci. 6;464-468, 1997.) 

SEQ ID NO:3351 encodes a polypeptide in the family of phorbol 

30 esters/glycerol binding proteins. Phorbol esters (PE) are analogues of diacylglycerol 
(DAG) and potent tumor promoters. DAG activates a family of serine-threonine protein 
kinases, known as protein kinase C. The N-terminal region of protein kinase C binds 
PE and DAG, and contains one or two copies of a cysteine-rich domain of about 50 
amino acid residues. Other proteins having this domain include diacylglycerol kinase; 

35 the vav oncogene; and N-chimaerin, a brain-specific protein. The DAG/PE binding 

t>1 
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domain binds two zinc ions through the six cysteines and two histidines that are 
conserved in the domain. The consensus pattern is: H-x-[LIVMFYW]-x(8, 1 l)-C-x(2)- 
C-x-(3)-[LIVMFC]-x(5, 10)-C-x(2)-C-x(4)-[HD]-x(2)-C-x(5, 9)-C 

SEQ ID NO:2216 encodes a polypeptide having a WW/rsp5/WWP 
5 domain. The protein is named for the presence of conserved aromatic positions, 
generally tryptophan, as well as a conserved proline. Proteins having the domain 
include dystrophin, vertebrate YAP protein, and IQGAP, a human GTPase activating 
protein which acts on ras. The consensus pattern is: W-x(9,l l)-[VFY]-[FYW]-x(6,7)- 
[GSTNE]-[GSTQCR]-[FYW]-x(2)-P. 

10 SEQ ID NO:2428 encodes a member of the dual specificity phosphatase 

family, having a catalytic domain, and SEQ IDS NOs:2281 and 2310 encode members 
of the protein tyrosine phosphatase family. These families are related and classified as 
tyrosine specific protein phosphatases. The enzymes catalyze the removal of a 
phosphate group from a tyrosine residue, and are important in the control of cell growth, 

15 proliferation, differentiation, and transformation. The consensus pattern is [LIVMF]-H- 
C-x(2)-G-x-(3)-[STC]-[STAGP]-x-[LIVMFY]. 
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Table 1 



SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY. 


1 


377044 


RTA00002676F.p.l l.2.P.Seq 


F 


M00039329A.COI 


CH09LNL 


2 


377708 


RTA00002633 F.m.0 1 .2. P.Seq 


F 


M00040039A:G08 


CH09LNL 


.> 


427782 


RT A 00002666 F . 1.06 . t . P . Seq 


F 


M00032638D:A06 


CHOSLNH 


4 


29372 


RTA000027 12F.a.06. 1 .P.Seq 


F 


M00023282A:C02 


CH04MAL 


5 


455003 


RTA00002694F.b.02. LP.Seq 


F 


M00043419D:A10 


CH20COHLV 


6 


380625 


RTAOOO02634F.d.03.2.P.Seq 


F 


M00040113D:G10 


CH09LNL 


7 


450959 


RTA0000269! F.b.05.3. P.Seq 


F 


M00043306D:B07 


CHI7COHLV 


3 


397351 


RTA0O0026S0F.b.04. LP.Seq 


F 


M00039775A:A09 


CH09LNL 


9 


20652 


RTA000027i0F.k.0L LP.Seq 


F 


M00022440B:E01 


C HO 3 VIA H 


10 


97330 


RTA00002663F.k. IS. LP.Seq 


F 


M00022767B:GI 1 


CH05MAH 


i: 


373071 


RTA00002670 Fj.23 . 1 .P.Seq 


F 


M00033442A:D06 


CH09LNL 


12 


162369 


RTA000027 1 3 F.e.OL LP.Seq 


F 


M00027292D:F10 


CH04MAL 


13 


401247 


RTA00002685F.L 15.2. P.Seq 


F 


M0003950SA:C!2 


CHI2EDT 


14 


430738 


RTA00002669F.i. 1 5.3-. P.Seq 


F 


M00033251D:BO9 


CHOSLNH 


— — 

15 


46/79 


RTA000027 11 F.c. 14. LP.Seq 


F 


M00022360C;GO4 


CH03MAH 


16 


375772 


RTA000O268 1 F.p.O L2.P.Seq 


F 


M000399O^C.GO5 


CH09LNL 


17 


4306S9 


RTA00002669F.j.0 1 .3. P.Seq 


F 


M00035243BA05 


CHOSLNH 


IS 


376546 


RTA000026"7F.d.07.2.P.Seq 


F 


M0003934fC:CI2 


CH09LNL 


19 


430041 


RTA00002667F.L 1 7. 1 P.Seq 


F 


M00032790B:A07 


CHOSLNH 


20 


431643 


RTA00002669F.L 1 6. 1 .P.Seq 


F 


M000332"6D:H09 


CHOSLNH 


21 


19422 


RTA000O2709F .c.02. LP.Seq ]_ F 


M00005449B:B10 


CH02COH 


22 


376302 


RTA00002677F.C. ! 3.2. P Seq | F 


M00039344B:GO7 


CH09LNL 


23 


376314 


RTA000026MF h.02. 1 .P.Seq j F 


M00039l39C:Gt2 


CHO c LNL 


24 


375492 


RTAG00G26"F.m. 19.2. P.Seq 


F 


M0003941S3.DOS 


CH09LNL 


25 


379114 


RTA0000268 1 F.n.24.2.P.Seq 


F 


M00039903CF03 


CHO°LNL 


26 


330663 


RTA00002670F.pJ L LP.Seq 


F 


MO003353iC:HIO 


CKO^LNL 




21581 7 


RTA00002664F i.l 9.2. P.Seq 


F 


MO002763-tA:Di ! 


CH04MAL 


23 


375749 


RTA0O0O263OF.r.23. LP.Seq 


F 


M00059795D:G06 


CHO^LNL 


29 


430396 


RTAO00O2669F.b.20.4.P.Seq 


F 


MO0O33!S5C D01 


CHOSLNH 


30 


380462 


RTA00002670F.O.0 1 . 1 .P.Seq 


F 


MO00535"OB:EO6 


CHOOLNL 


31 


430396 


RTA00002669F.b.20.3.P.Seq 


F 


MO0033IS5C:DO! 


CHOSLNH 


32 


376996 


RTA00002676F.p.l3.2.P.Seq 


F 


MO0O39329C:BIO 


CHO°LNL 


■ ■ — 

.>.? 


374846 


RTA00002b77F.k. 1 9.2.P.Seq 


F 


M00039412D:G06 


CHOOLNL 


34 


379075 


RTA00002672F.n.t3.2.P.Seq 


F 


M0003903°B:E03 


CHO^LNL 


35 


374172 


RTAC0002673F.k. 16.2. P.Seq 


F 


M000390sTD:D06 


CHOOLNL 


36 


373104 


RTA000026S5F.O. 1 5.2. P.Seq 


F 


M000400^SD:GI2 


CHOOLNL 




186302 


RTA000027 1 }F.m.2 1 . 1 P.Seq 


F 


M000Z75^1B C04 


CHO-LMAL 


33 


427947 


RTA00002t>65F.o.0I . LP.Seq 


F 


M000324O5B:D02 


CHOSLNH 


39 


375130 


RTAOOG02675F.d. 1 7. LP.Seq 


F 


M0003OOc-D:HO9 


CHOOLNL 


40 


377584 


RTA00002633F. 1.22.2. P.Seq 


F 


M000400SSC.EI0 


CHOOLNL 


41 


377364 


RTA00002o r SF.a. 15. 2. P. Seq 


F 


M0003^43:C AO! 


CHOOLNL 


42 


37634" 


RTAOOOO:6"5F.1.0S. i .P.Seq 


F 


M0003 c >2-i o C:Gl 1 


CHOOLNL 


43 | 446 T4 7 


RTA0000268°F.d. 16. 2. P.Seq 


F 


M00042740A:E09 


CHI5C0N 


44 j 28092 


RTA00002"! IF. l>. 12. LP.Seq \ F 


M0C0230J2A:B05 


CH03MAH 


45 


378206 


RTAOO';D26:iF.j.20.3 P-Seq F 


M000335SSC:G04 


CHO°LNL 


46 


378200 


RfA00002o"IF.j.20.2. P.Seq | F 


M000335SSCG04 


CHOOLNL 


4" 


14940 


RTAOO0t:2"O9F.J.I LI P.Seql F MCO0056:3 A.G02 t CH02COH 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


43 


37341 1 


RTA00002672F.g.l3.2.P.Seq 


F 


M00059004B:A06 


CH09LNL 


49 


38120 


RT A00002 7 1 2 F . i A 4 . 1 . P. Seq 


F 


M00026927D.F02 


CH04MAL 


50 


375730 


RTA00002678F.U5.2. P.Seq 


F 


M00059612B:G05 


CH09LNL 


51 


423959 


RTA00002667F.h. 15. LP.Seq 


F 


M0005281 IB:D02 


CH08LNH 


52 


376851 


RTA00002677F.c.05.2.P.Seq 


F 


M00059541C:H1I 


CH09LNL 


53 


373808 


RTA0000267 1 F.d. 14. 2. P.Seq 


F 


M00058272A:GOI 


CH09LNL 


54 


376163 


RTA00002675F.n.l7. 1 .P. Seq 


F 


M00039258B;E06 


CH09LNL 


55 


13653 


RTA00002712F.O.08. 1 .P.Seq 


F 


M00027!55A:B1I 


CH04lVIAL 


56 


137632 


RTA00002664F.i. 1 5. 1 . P.Seq 


F 


M00027617B:C12 


CH04MAL 


57 


574122 


RTA00002675F.L22.LP.Seq 


F 


M00039104D:C09 


CH09LNL 


58 


374946 


RTA00002675F.J.24. LP.Seq 


F 


M00059096A:E07 


CH09LNL 


59 


375666 


RTA00002677F.nJ6.2.P.Seq 


F 


M00059422D:F04 


CH09LNL 


60 


!6°369 


RTA000027 1 5F.d.24. LP.Seq 


F 


M00027292D:F10 


CH04MAL 


61 


21480 


RTA00002709F.C. 1 3.2. P.Seq 


F 


M0000555lD:FO6 


CH02COH 


62 


18560 


RTA000027 1 1 F.e.20. LP.Seq 


F 


M00022958B:F07 


CH05MAH 


63 


96575 


RTA00002665F.j.08. 1 P.Seq 


F 


M00022641CH05 


CH05MAH 


64 


377576 


RTA00002682F.f. 1 8. 1 .P.Seq 


F 


M00059975C:C11 


CH09LNL 


65 


446747 


RTA00002689F.d. 16.5. P.Seq 


F 


M00042740A:E09 


CH15CON 


66 


37931 1 


RTA00002682F.g.O 1 . 1 .P. Seq 


F 


M00059976D:A12 


CH09LNL 


67 


379311 


RTA000O2682FT.24. 1 .P.Seq | F 


M00059976D:A12 


CH09LNL 


63 


124549 


RTA000O27I3F.C.07. LP.Seq 1 F 


M00027257C:B08 


CH04MAL 


69 


449785 


RTA 0000269 lF.c.07.5. P. Seq | F 


M00043345OA06 


CHI7COHLV 


70 375134 


RTA000O2675F.k.22.2. P.Seq 1 F 


M00059099A:H03 


CH09LNL 


71 


186593 


RTA000027 15F.n. 1 5. 1 .P.Seq I F 


M00027620D:F1 1 


CH04MAL 


72 


449831 


RTA00002691F.a.l7.5. P.Seq | F 


M0004251SD:A06 


CH17COHLV 


73 


379678 


RTAO0002676F.b.O6. LP.Seq j F 


M00059274B.G07 


CH09LNL 


74 


20599 


RTA0000:7O3F.h.O6. 1 .P.Seq | F 


M00004264B:A05 


CH01COH 


75 


41115 


RTA000027l5F.o lLLP.Seq | F 


M00027652B:Fl i 


CH04MAL 


76 


2 1 109 


RTA00002708F.h. 12. LP.Seq 


F 


M00004273A:F09 


CH01COH 


77 


455702 


RTA00002694F.b. 1 1 . LP.Seq 


F 


M00043455C.G07 


CH20COHLV 


78 


380643 


RTA00002635F.p.09.2. P.Seq 


F 


M00040I05B:H10 


CH09LNL 


79 


374413 


RTA00002672F.L 15. 2, P.Seq 


F 


M00059015B:G10 


CHO^LNL 


80 


378891 


RTA00002672F.Ll8.2.P.Seq 


F 


M00059016A:A02 


CH09LNL 


31 


379574 


RTA00002672F.k. 1 1.2. P.Seq 


F 


M00059028C:BI 1 


CHO^LNL 


82 


17253 


RTA00002709F.h.25. 1 .P.Seq 


F 


M00006866A;D07 


CH02COH 


85 


21565 


RTA00002709F.e. Ill .P.Seq 


F 


M00005778B:F09 


CH02COH 


84 


375996 


RTA000O2675F.n. Ill .P.Seq 


F 


M00039I0SD:B06 


CHO^LNL 


85 


380437 


RTA00002683F.C.09. LP.Seq 


F 


M00040039D:D06 


CH09LNL 


86 


450729 


RTA00002669F.h. 18.2. P.Seq 


F 


M00035226A.AI 1 


CHOSLNH 


87 


376791 


RTA00002674F 1.17. LP.Seq 


F 


M00039166B:G06 


CH09LNL 


83 


375760 


RTA00002672F.p.20. LP.Seq 


F 


M00059049D:G07 


CH09LNL 


89 


573357 


RTA00002672F.p.22. LP.Seq 


F 


M00059050A;H10 


CH09LNL 


90 


576455 


RT A00002 678 F h . 1 7 .2 . P . Seq 


F 


M00059476B-.A02 


CH09LNL 


91 


575881 


RTA00002672F.b 20. LP.Seq 


F 


M00058638D:H05 


CH09LNL 


92 


577086 


RTA00002676F.p.07. LP.Seq 


F 


M0005952SD:D07 


CH0°LNL 


93 


577889 


■RTA00002672FX.08. LP.Seq 


F 


M00053661A:A07 


CH09LNL 


94 


5S0442 


RTA000026S4F.b.05.:.P Seq 


F 


M000401 1 1C:D05 


CHO^LNL 



1o 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


AD 1 r\iT t T"I/~\XI 

URJcNTATION 


CLONE IU 


i ion » r> \ / 

LIBRARY 


95 


374689 


RTA000026 / 6F. m. 1 ->.2.P.beq 


F 


M00039jI8B:B09 


CH09LNL 


96 


375339 


RTA000026/8F.m.2.>-2. P.Seq 


F 


M000j9616A:B10 


• p j i .4% • it* 

CH0PLNL 


97 


14197 


RTA000027 1 OF.f. 1 j. 1 P.Seq 


F 


M00022084D:B0I 


CHOjMAH 


98 


380666 


RTA00002684 F.c.04.2. P.Seq 


F 


M000401 I^B:H12 


CH09LNL 


99 


377352 


RTA000026/7F.i.lj.2.P.Seq 


F 


M000j9404B:A03 


CH09LNL 


100 


379188 


RTA00002682F.a.0j. 1, P.Seq 


F 


MO00j9914D:G 12 


y™* f f y\ yv • \ »t 

CH09LNL 


101 


428269 


RTA00002666F.C. 1 j. 1 .P.Seq 


F 


M000j25j9B:C1 I 


CH08LNH 


102 


373464 


RTA0000267 1 F.I. 13.3. P.Seq 


F 


M000j832/A:C1 I 


y^* * # a. a * v f r 

CH09LNL 


103 


15527 


RTA000O27I0F.p.07.l. P.Seq 


F 


M0002274 / D: E0j 


/ H ff t /\ ^ ft f 4 # * 

CHOjMAH 


104 


377504 


Mk > «k #■». 41 41. 4*. ,4* • * • #■* ^4 f"ft rf^* 

RTA0000267 1 F.i. 1 7.3. P.Seq 


F 


M000j8j0jC:D02 


CH09LNL 


105 


33508 


RTA000027 1 OF.g. 1 7. 1 .P.Seq 


F 


M0002218jB:C02 


CHOjMAH 


106 


129179 


RTA00002662F.d. 19.2. P.Seq 


F 


M000071d^C:F11 


y** t t /\ ^ y^ y^v | * 

CH02COH 


107 


377086 


RTA00002676F.p.07.2.P.Seq 


F 


M00039328D:D07 


CH09LNL 


108 


375872 


RTA00002675F.h. 1 5. 1 .P.Seq 


F 


M00039233A:A0j 


,y^* ■ w a yx ■ v i » 

CH09LNL 


109 


375652 


RTAGG002676F.L07.3. P.Seq 


F 


M000j93OjC.FU 


y^^ * ff yv j-^ ff ft. V ff 

CH09LNL 


110 


374266 


RTA00002674FJ.08.2. P.Seq 


F 


M00039I44C.E06 


CH09LNL 


lit 


378983 


RTA00002682F.3.07, 1 P.Seq 


F 


M00039915D:C1 1 


^» » • a f\ 4 ft. * 4 

CH09LNL 


112 


377343 


RTA00002684F.g.04. 1 .P.Seq 


F 


M00040302C.A04 


4*"* T • 41 - ^ if 1 1 ■ 

CH09LNL 


113 


378679 


RTA0000268 1 F.f. 1 6.2.P.Seq 


F 


M00039869B:F06 


CH09LNL 


114 


374095 


RTA0000267 1 F.p.08.2. P.Seq 


F 


M00038618C.C08 


CH09LNL 


1 15 


375843 


RTA0000267 1 F.o.06.2. P.Seq 


F 


M00038614C:H1 1 


CHO^LNL 


1 16 


377788 


RTA000026S4F.H.0 1.2. P.Seq 


F 


M00040305C:H06 


CH09LNL 


117 


21403 


RTA00002709F.j,05. 1 .P.Seq 


F 


M00006928D:D07 


CH02COH 


118 


23184 


RTA00002709F.b.05.2.P.Seq 


F 


M00005358B;B06 


CH02COH 


119 


15671 


RTA000027 lOF.k. 16.1 .P.Seq 


F 


M00022495D:H08 


CH03MAH 


120 


177367 


RTA00002663F.m.22. 1 .P.Seq 


F 


M00022936D:H09 


ft*^*%ft 1 ft tf 

CH03MAH 


121 


3777SS 


RTA000026S4F.g.24. 1 .P.Seq 


F 


M0004030^C:H06 


CH09LNL 


122 


375058 


RTA00002675F.h.02.l. P.Seq 


F 


M000j>92j0D:G12 


Cn09LNL 


123 


380412 


RTA00002680F.k. 1 5.2. P.Seq 


F 


M000j9816B:D04 


CHO^LNL 


124 


178447 


RTA00002663F.n.06.1 .P.Seq 


F 


ft. t A^ A A <*S 4^ ^"V » * f 4N A 

M0002j>00.*A:H04 


CHOjMAH 


125 


376647 


RTA00002674F.h.07. 1 .P.Seq 


F 


M000^9140D:D09 


y-* t f Aj"M Vtf 

CH09LNL 


126 


44679 


RTA 0000266 1 F.e. 1 9. 1 .P.Seq 


F 


M00003800A:F09 


ChOILOH 


127 


377659 


RTA00002678F.a.04.2. P.Seq 


F 


M000^°4j0B:FI2 


CHO^LNL . 


128 


379703 


RTA00002682F.h.03. 1 P.Seq 


F 


M000^9982C:H04 


CH09LNL 


129 


374673 


RTA00002673F.e.08.2. P.Seq 


F 


ft. f rt. y> ^ f\ a. j4- o /V ■ 

M000j906SB:B04 


/— ■ * * Art I V • | 


130 


20513 


RTA000027I0F.J. 12.1. P.Seq 


F 


M00022j91D:F10 


C r.UjMAH 


131 


376124 


RTA00002682F.n. 1 7. 1 .P.Seq 


F 


M0004002 1 A:F09 


CH09LNL 


132 


374679 


RTA00002676F.d.0/.2.P.Seq 


F 


ft ij \ aa^ a^ ft i r*\ r*i /\ i 

M000j928 ID:B04 


ChO^LNL 


1 jj 


2 j 1 84 


K rAUUUU- f\jvr .D.Uj. 1 .r.oeq 


c 
r 


[VIVUUU J J _* oD . DUO 


v_ n l' _ V- v. ' n 


134 


430953 


RTA00002668F.i,23. 1 P.Seq 


F 


M00033007CE01 


CH0SLNH 


135 


380442 


RTA00002684F.b-.05. 1 .P.Seq 


F 


M00040 1 1 1C:D05 


CK09LNL 


136 


12374 


RTA00002709F.a.0 1 . 1 P.Seq 


F 


M00004S25D:D05 


CH02COH 


137 


427466 


RTA00002665F.b.l 1.3. P.Seq 


F 


M0002S184D-.G10 


CH0SLNH 


138 


36611 


RTA0000266SF.f.03.1. P.Seq 


F 


M00032942D:C12 


CH08LNH 


139 


33756 


RTA00002662F.a.l8.2.PSeq 


F 


M00005359A:D04 


CH02COH 


140 


456026 


RTA00002694F.e.03. 1 .P.Seq 


F 


M00043616C:A05 


CHI0COHLV 


141 


15766 


RTA000027 1 OF.k.02. 1 P.Seq 


F 


M00022444D:GOI 


CHOJMAH 



^1 
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oty 
rpv 


n i rcTco 
L LUS I EK 




/~\0 I CXIT A T f OX 

UKJhiN I A 1 


LLCJNc lU 


LIBRARY 


142 


24j^2 


RTA00002 /OVr.a.O?. 1 .P.Seq 


F 


A A A A A i A ^ A X^* f * A^ 

M000043j9C:H02 


CH02COH 


14_> 


24j54 


RTA00002709r.a.O^. I .P.Seq 


F 


ft\ J r\ r\ A A t A^ *t ¥ I A\^ 

M000048j2D:H02 


V ■ . k aaft .a. aj a! 

CH02COH 


144 


3791 14 


RTA 0000268 1 r o.0 1 .2. P.Seq 


F 


MO00j99OjC:F0j 


CH09LNL 


14} 


19609 


fy~r 4 a a a a-^ t ac\ r - A» i n c* _ 

RTA00002709F.C.OD. 1 P.Seq 


F 


MO000d4d.C:A03 


CH02COH 


146 


21635 


RTA00002709F.e.2j. 1 .P.Seq 


F 


M00006d81 D:F08 


a'*^ aT # am. ^bb. ^*bl a**b. a> m 

CH02COH 


147 


330085 


RTA00002682F.1. 10. 1 .P.Seq 


F 


M000j998; A:F09 


CH09LNL 


1 A O 

I4S 


20700 


RTA000027 lOF.i. 18.1 .P.Seq 


F 


M000223 / jA.BOd 


a""* t t aaft ■ a & a> a> 

CH03MAH 


149 


j 79981 


RTA 00002682 F.i. 18.1 .P.Seq 


F 


afc al ^\ ^ a^ a^ a*fe k rf' 

M000j9988A:E06 


aF"* # ■ y>. rf*k f> h * m 

CH09LNL 


bO 


^ a*" * a*"fc ■ 

j76d91 


RTA0000267:>F.c.O 1 . 1 .P.Seq 


F 


m « a*a. a^ a*a. A a 1 "!. ft 4. ^ aF^h. a^ft. tJ 

M0003921 jA:D01 


^■■a fl» * i aBL a>k # bl • a> 

CH09LNL 


b 1 


92058 


RTA0000266->F.m.04. 1 P.Seq 


F 


■1 al a*V a"L ^ak ak ^ a * a*4. a** 

M0002289^A;H08 


CH03MAH 


b2 


196936 


RTA0000266jF.rn.02. 1 P.Seq 


F 


M0002288fC:H05 


CH03MAH 


b-> 


430702 


RTA00002668F.h.04. 1 .P.Seq 


F 


M00032990B:A1 1 


CH08LNH 


b4 


373448 


RTA00002680F.n.2 1 .2. P.Seq 


F 


M00039832A:B12 


CH09LNL 


155 


41606 


RTA000027 1 3 F.e. 1 0. 1 .P.Seq 


F 


M00027301 A:G05 


CH04MAL 


b6 


213817 


RTA00002664F.L 1 9. 1 .P.Seq 


F 


M0O02763-iA:Dll 


CH04MAL 


157 


373464 


RTA 0000^671 F.I. 13.1. P.Seq 


F 


M00038327A:C11 


CH09LNL 


153 


379433 


RTA00002679F.k. 1 2. 1 .P.Seq 


F 


M00039700B:D02 


CH09LNL 


159 


375796 


RTA00002680F.f. 1 1 .P.Seq 


F 


M00039795 3:H10 


CH09LNL 


160 


375796 


RTA00002680FJ.1 :.2.P.Seq 


F 


tM00039795B:H10 


CH09LNL 


161 


120485 


RTA00002663F.b. 1 2 . 1 .P.Seq 


F 


M0002I66?B:FI2 


CH03MAH 


162 


374291 


RTA00002673F.f. 1 7. 1 .P.Seq 


F 


M00039072C:E02 


CH09LNL 


163 


380513 


RTA00002677F.p.l 5.2. P.Seq 


F 


M0003942SC:E01 


CH09LNL 


164 


379416 


RTA00002683F.J.0. .2. P.Seq 


F 


M0004007'D:C1 1 


CH09LNL 


165 


378178 


RTA00002680F.1. 1 +. 1 .P.Seq 


F 


M00039820A:F1 1 


CH09LNL 


166 


427947 


RTA0000266?F.n.24. 1 .P.Seq 


F 


M00032495B:D02 


CH03LNH 


167 


427269 


RTA00002665F .d.03 .3 . P.Seq 


F 


M000232I2C:B08 


CH08LNH 


168 


20451 


RTA000027 1 OF.j. 10.1. P.Seq 


F 


M0002239!B:£01 


CH03MAH 


169 


377003 


RTA0000263jF.g.0^.2.P.beq 


F 


M00040062B:305 


CH09LNL 


170 


427759 


RTA0000266^F.o.l Ll.P.beq 


F 


M00032499C;A01 


CH0SLNH 


171 


427549 


RTA00002668F.k. 1 1 .P.Seq 


F 


M00033034C:A06 


CH0SLNH 


172 


373881 


RTA00002672F.b.20.2.P,Seq 


F 


M00033633D.H03 


CH09LNL 


I7j 


133215 


RTA00002664F.f. 1 j. 2. P.Seq 


F 


M00027200A:F02 


CH04MAL 


174 


379683 


RTA00002681 F.d.04.2.P.Seq 


F 


M00039357B;G10 


CH09LNL 


1 «-> a» 

I 7d 


380652 


RTA00002678F.d. 1 2. 2. P.Seq 


F 


M00039455D:H04 


CH09LNL 


176 


378334 


RTA00002679F.h. 10. 1 P.Seq 


F 


M00039632C:H1 1 


a*"fe V a jbl y-fc ar a ■ 

CH09LNL 


177 


j77930 


RTA00002680F.g. 14. 1 P.beq 


F 


M00039/98S:B02 


CH09LNL 


173 


378692 


l*^ — ft A A- A — \ y A A "T^ ^ A ^ A\ 

RTA00002680F.0. 20. _*. P.Seq 


F 


M00039835A:F07 


CH09LNL 


179 


32279 


RTA00002709F.d.2_*. I .P.Seq 


F 


M0000^6/^B:B12 


rf^Bi ■ B .A. ak a^"h ^'■k. t b> 

CH02COH 


1 OA 


j 76^79 


DT \ AAAAI^QAC y 1 < 1 O C^rt 

K 1 AUUUUJodUr c. 1 ?. 1 .r.beq 


F 


M000j978_ A.H 10 


CH09LNL 


181 


375963 


RTA00002675F.L 12.1. P.Seq 


F 


iM00039233A:B12 


CH09LNL 


182 


378683 


RTA00002680F.a. 14.2. P.Seq 


F 


M00039773D:A09 


CH09LNL 


183 


374946 


RTA00002673F.J.24.2. P.Seq 


F 


M00039096A:E07 


CH09LNL 


184 


429533 


RTA00002666F.g. 10. t. P.Seq 


F 


M00032534A:H03 


CH0SLNH 


185 


23338 


RTA0000271 1 F.e. 1". 1 P.Seq 


F 


MO002293OC:EO2 


CH03MAH 


186 


427970 


RTA00002665 F.j. 13.1. P.Seq 


F 


M0003I363A:E10 


CH0SLNH 


187 


379650 


-RTA0O0O2683F.h.22.2.P.Seq 


F 


M00040072C:G09 


CH09LNL 


183 


379661 


RTA00002676F.C.05.:. P.Seq 


F 


M00039277D:G10 


CHO^LNL 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORJENTAT10N 


CLONE ID 


LIBRARY 


189 


376182 


RTA000026 77 F. b. 1 7. 2 . P. Seq 


F 


M00039340BE07 


CH09LNL 


190 


374797 


RTA00002678F.b. !2.2.P.Seq 


F 


M00039444C:H02 


CH09LNL 


191 


375389 


RTA00002674F.a.l3.I.P.Seq 


F 


M00039I20CC09 


CH09LNL 


192 


397115 


RTA000026S3F.i.~>2.2.P.Seq 


F 


M00040076C:D06 


CH09LNL 


193 


1 86655 


RTA000027I2F.L2 1.1. P.Seq 


F 


M00026941D:A04 


CH04MAL 


194 


404682 


RTA00002687F.b. 13.1. P.Seq 


F 


M00039766D:HOl 


CH14EDT 


195 


19609 


RTA00002709F.C.05.2. P.Seq 


F 


M00005457CA03 


CH02COH 


196 


404682 


RTA00002687F.b.l3.2.P.Seq 


F 


M00039766D:H01 


CHI4EDT 


197 


380412 


RTA000026SOF.k. 15. 1 .P. Seq 


F 


M00039816B:D04 


CH09LNL 


198 


3944 1 3 


RTA00002639F.d. 17.3. P.Seq 


F 


M00042742D:D05 


CHI5CON 


199 


380086 


RTA00002679F.m. 1 6. 1 .P.Seq 


F 


M00039710OG03 


CH09LNL 


200 


430738 


RTA00002669F.i. 1 5.2. P.Seq 


F 


M00033231D:B09 


CH08LNH 


201 


40667 


RTA000027 1 2F.g.22. 1 .P.Seq 


F 


M00026882D:G09 


CH04MAL 


202 


39742 1 


RTA0000268 1 Fx. 1 6.2.P.Seq 


F 


M00039854B:F09 


CH09LNL 


203 


398775 


RTA00002679F.f. 1 1. 1 P.Seq 


F 


M00039675D:H05 


CH09LNL 


204 


87345 


RTA000027 1 2 F.f. 19.1. P.Seq 


F 


M00026850D:F09 


CH04MAL 


205 


379920 


RTA00002679F.b.24.2. P.Seq 


F 


M00039660C:C10 


CH09LNL 


206 


380666 


RTA00002684F.C.04.1. P.Seq 


F 


M000401 15B:H12 


CH09LNL 


207 


404340 


RTA00002687F.b.05.2.P.Seq 


F 


M00039764CD07 


CH14EDT 


208 


375509 


RTA000O2680F.e.08.2. P.Seq 


F 


M00039790B:D03 


CH09LNL 


209 


46423 


RTA000027 1 2F.L02. 1 .P.Seq 


F 


M00026914A:H10 


CH04MAL 


210 


401713 


RTA00002685F.p.l0.2. P.Seq 


F 


M00039647A.H1 1 


CH12EDT 


211 


377003 


RTA00002683 F.n.09 . LP. Seq 


F 


M00040062B.B05 


CH09LNL 


212 


378891 


RTA00002672F.i.l8.l.P.Seq 


F 


iV|00039016A:A02 


CH09LNL 


213 


412778 


RTA00002685F.i.07.2.P.Seq 


F 


M00039533D:F04 


CH12EDT 


214 


373786 


RTA00002679F.a.20.2. P.Seq 


F 


M00039655C:C07 


CH09LNL 


215 


378692 


RTA00002680F.O.20.2 P.Seq 


F 


M00039835A:F07 


CH09LNL 


216 


88888 


RTA000027 1 3F.F.22. 1 .P.Seq 


F 


M00027355A:B07 


CH04MAL 


217 


358187 


RTA00002676F.b.04.2.P.Seq 


F 


M00039273D:B02 


CH09LNL 


218 


377131 


RTA00002632F.e. 10.1. P.Seq 


F 


M00039938C:EI 1 


CH09LNL 


219 


21488 


RTA00002708F.f. 17.1. P.Seq 


F 


M00004152A:C12 


CH01COH 


220 


447487 


RTA00002689 F.e.04.3 . P.Seq 


F 


M00042895A:D10 


CH15CON 


221 


364 


RTA000027 1 OF.a.06. 1 .P.Seq 


F 


M00007929CB03 


CH03MAH 


222 


404024 


RTA00002687F.e.I8.2.P.Seq 


F 


M00039958A:A03 


CHUEDT 


223 


152305 


RTA000027 1 2F.d.02. 1 .P.Seq 


F 


M00023376B:G04 


•CH04MAL 


224 


106050 


RTA00002713F.0. 17.1. P.Seq 


F 


M00027668C:H12 


CH04MAL 


225 


41126 


RTA00002713F.1. 12.1. P.Seq 


F 


M00027546CB10 


CH04MAL 


226 


1 1 3496 


RTA000027l3F.n.20.l. P.Seq 


F 


M00027625A:H01 


CH04MAL 


227 


447487 


RTA00002639F.e,04. 1 .P.Seq 


F 


M00042895A:DI0 


CH15CON 


228 


146335 


RTA000027I2F.j.l7.].P.Seq 


F 


M00O269S0A:D09 


CH04MAL 


229 


376647 


RTA00002674F.h.07.2. P.Seq 


F 


M000J9140D:D09 


CH09LNL 


230 


376746 


RTA00002674F.f. 12.2. P.Seq 


F 


M00039133B:F08 


CH09LNL 


231 


373523 


RTA00002674F.n.2l.2. P.Seq 


F 


M00039I77B:D03 


CH09LNL 


232 


455466 


RTA00002694F.C. 10.!. P.Seq 


F 


M00043461D:E06 


CH20COHLV 


233 


374031 


RTA000026S3F.p.. 1 7.2. P.Seq 


F 


M00040105C:FH 


CH09LNL 


234 


373997 


RT A000026 73 F.m.04 . 2 . P.Seq 


F 


MO0039l05C:B0S 


CH09LNL 


235 


455717 


RTA000O2694F.a.O6. 1 P.Seq 


F 


M00042593CG06 


CH20COHLV 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORJENTATION 


CLONE ID 


LIBRARY 


236 


373837 


RTA00002672F.p.22.2.P.Seq 


F 


M00039050A:H10 


CH09LNL 


237 


374513 


RTA00002672F.L t6.2.P.Seq 


F 


M0003901fB:H09 


CH09LNL 


238 


375628 


RTA00002672F.k.04.2.P.Seq 


F 


M00039026D:F05 


CH09LNL 


239 


377732 


RTA00002681F.p-09.KP.Seq 


F 


M00039910CG10 


CH09LNL 


240 


378326 


RTA00002681F.m. II. LP.Seq 


F 


M00039896CH01 


CH09LNL 


24! 


378001 


RTA0000268 1 F.m.22. LP.Seq 


F 


M00039898D:C06 


CH09LNL 


242 


378459 


RTA0000268 1 F.i.07.2.P.Seq 


F 


M00039879D.B1 1 


CH09LNL 


243 


373862 


RTA0000267 1 F.g.O 1 2. P.Seq 


F 


M00038284B:H04 


CH09LNL 


244 


373252 


RTA00002670F.k. 16.1. P.Seq 


F 


M0003345IA:H01 


CH09LNL 


245 


378475 


RTA00002672F.g.24. LP.Seq 


F 


MOOO39O06D:B0I 


CH09LNL 


246 


379941 


RTA00002682F j. 15. LP.Seq 


F 


M00039990CDIO 


CH09LNL 


247 


427703 


RTA00002665F.e. Ill .P.Seq 


F 


M00028357A:GI0 


CH08LNH 


248 


373976 


RTA0000267 1 F.p. 1 5.2.P.Seq 


F 


M000386I9B:A03 


CH09LNL 


249 


431643 


RTA00002669F.l.l6.3 : P.Seq 


F 


M00033276D:H09 


CHOSLNH 


250 


383502 


RTA00002670F.k.07. 1 .P.Seq 


F 


M00033446D:B02 


CH09LNL 


25! 


378764 


RTA0000268 1 F.j.04. 1 P.Seq 


F 


M00039884A:H1 1 


CH09LNL 


252 


431629 


RT AO0002669 F. 1.14. 3. P.Seq 


F 


M00033276B:G0S 


CHOSLNH 


253 


372992 


RTA00002671F.b.l6.2.P.Seq 


F 


M00033594CB03 


CH09LNL 


254 


431601 


RT A00002669 F . k.08 . 3 . P. Seq 


F 


M00033263B:G04 


CHOSLNH 


255 


21059 


RTA000027 1 OF.c.05. LP.Seq 


F 


MOOOOS053A:FIO 


CH03MAH 


256 


430689 


RTA00002669F.i.24.J.P.Seq 


F 


M00033243B:A05 


CHOSLNH 


257 


131764 


RTA00002662F x. 1 4. 1 .P.Seq 


F 


M00006893CE07 


CH02COH 


258 


373300 


RTA00002674F.c.2L2.P.Seq 


F 


M00039I26D:A08 


CH09LNL 


259 


384601 


RTA00002670F.k.06. 1 .P.Seq 


F 


M00033446C:G08 


CHO^LNL 


260 


375389 


RTA00002674F.3. 1 3. 2. P.Seq 


F 


M00039l20C:C09 


CH09LNL 


261 


15248 


RTA000027 10FT.23. LP.Seq 


F 


M00022I27C:H03 


CH03MAH 


262 


428134 


RTA00002666F.C 1 5. 1 P.Seq 


F 


M00032540A:A09 


CHOSLNH 


263 


374 1 


RTA00002672F.a. 1 9. LP.Seq 


F 


M0003S633A:D07 


CH09LNL 


264 


136225 


RTA00002676F.n.02.2.P.Seq 


F 


M000393l9C:A04 


CH09LNL 


265 


401713 


RTA00002685F.p.l0. LP.Seq 


F 


M00039647A:Hl I 


CHI2EDT 


266 


27104 


RTA0000266 1 F.a.09. 1 .P.Seq 


F 


M00001363D:D09 


CHOICOH 


267 


207466 


RTA00002664F.j.03.2.P.Seq 


F 


MO0027733A:A02 


CH04MAL 


268 


143045 


RTA00002663F.a.02. LP.Seq 


F 


M0000794lD:C09 


CH03MAH 


269 


378830 


RTA00002675F.e.07. 1 P.Seq 


F 


M0003922LA:H03 


CHO°LNL 


270 


21731 


RTA00002709F.k.07. 1 P.Seq 


F 


M00O0TOl3A:D09 


CH02COH 


271 


428552 


RTA00002666F.C. 16. LP.Seq 


F 


M0003254ID:H08 


CHOSLNH 


272 


187632 


RTA00002664F.i.l5.2.P.5eq 


F 


M0002"617B;C12 


CH04MAL 


273 


431053 


RTA00002668F.O.05.2. P.Seq 


F 


M00O3313OB.FO6 


CHOSLNH 


274 


188972 


RTA00002664F.d.20. LP.Seq 


F 


M00027030C.H06 


CH04MAL 


275 


430678 


RTA0000266SF.h. \ 2- LP.Seq 


F 


M00032994A.A08 


CHOSLNH 


276 


374042 


RTA00002672F.a.08 LP.Seq 


F 


M0003863IC.BI0 


CHO°LNL 


277 


24332 


RTA00002709F.J.07. LP.Seq 


F 


M00006935CF06 


CH02COH 


278 


376764 


RTA00002674F.r20. LP.Seq 


F 


M00039135DF05 


CHO°LNL 


279 


134338 


RTA00002662F.C. 1 5.2.P.Seq 


F 


M00006897A:H02 


CHOICOH 


280 


37554 1 


RTAOO0O26S0F.d.2L2.P.5eq 


F 


M000397S8A:E03 


CHO°LNL 


281 


228900 


RTA0000:664F.e.08.2.P.Seq 


F 


M00027085CEI I 


CH04MAL 


282 


58063 


RTA0000266 1 F.h. ! S. LP.Seq 


F 


M00004234A:E07 


CHOICOH 
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ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


283 


380500 


RTA00002670F.p. l9.2.P.Seq 


F 


M00O53585B:E06 


CH09LNL 


284 


34928 


RTA000027 IOF.p.2 1.1 .P.Seq 


F 


M00022795B:G06 


CH05MAH 


285 


374028 


RTA00002674F.k.03.2.P.Seq 


F 


M00059156A:B1 1 


CH09LNL 


286 


374121 


RTA0O0O2672F.h.22.2.P.Seq 


F 


M00059015A:C09 


CH09LNL 


287 


429547 


RTA00002668F.C.07. 1 .P.Seq 


F 


M000529J7D:G09 


CH08LNH 


288 


380668 


RTA00002670F.p.l i.2.P.Seq 


F 


M0003358IC:H10 


CH09LNL 


289 


258704 


RTA0O002665F.m.O6. 1. P.Seq 


F 


M00032480B:E!0 


CH08LNH 


290 


380325 


RTA00002670F.p.22.2.P.Seq 


F 


M00055583D:B05 


CH09LNL 


291 


378326 


RTA0000268 1 F.m. 1 1 .2.P.Seq 


F 


M00039896C:H01 


CH09LNL 




375613 


RTA 00002675 F.d. 1 3. 1 -P.Seq 


F 


M000392!SA:F03 


CH09LNL 


293 


20999 


RTA00002709F j. 1 6, 1 .P.Seq 


F 


M00006977CG04 


CH02COH 


294 


29102 


RTA000027 lOF.p.18. 1 .P.Seq 


F 


MO0O22793D:B0l 


CH03MAH 


295 


379334 


RTA00002680F.b.22.1. P.Seq 


F 


M00039773C:A04 


CH09LNL 


296 


23943 


RTA00002709F.L 1 2. 1 .P.Seq 


F 


M00006886D:H02 


CH02COH 


297 


373998 


RTA00002672F.a.l0.2.P.Seq 


F 


iVI00058631D:B02 


CH09LNL 


298 


373325 


RTA00002672F.c.U.2.P.Seq 


F 


M00058662B:AI2 


CH09LNL 


299 


373818 


RTA00002672F.e. 1 5.2.P.Seq 


F 


M00058WC:G08 


CH09LNL 


300 


429843 


RTA00002668F.C. 1 0. 1 .P.Seq 


F 


M00032913B:E06 


CH03LNH 


301 


427755 


RTAOO002665F.d 1 9.3. P.Seq 


F 


M00023516B:H12 


CH08LNH 


302 


189177 


RTA00002664F.c.23.2.P.Seq 


F 


M00026922C:G05 


CH04iMAL 


303 


1 3294 


RTA00002709F.j. 15.1. P.Seq 


F 


M00006968A.G08 


CH02COH 


304 


178801 


RTA00002665F.n.0 1 . 1 .P.Seq 


F 


M00022997A:F06 


CH05MAH 


305 


230865 


RTA00002664F.d.03.2.P.Seq 


F 


M0002692SD:A05 


CH04MAL 


306 


1 7880 1 


RTA00002663F.m.24.1. P.Seq 


F 


M00022^7A:F06 


CH05MAH 


307 


378809 


RTA00002672F.g.2 1 .2.P.Seq 


F 


M0003900fC:H01 


CH09LNL 


308 


378957 


RTA00002670F.d.l 7.2.P.Seq 


F 


MO0O55562CC05 


CH09LNL 


309 


373523 


RTA00002674F.n.2 1 . 1 .P.Seq 


F 


M00039177B:D05 


CH09LNL 


310 


375458 


RTA00002673F.1.06.2.P.Seq 


F 


M0005961 ID:D11 


CH09LNL 


311 


429794 


RTA00002668F.C.09. 1 .P.Seq 


F 


M0005291SB:D0S 


CH08LNH 


312 


72797 


RTA0000266 1 F.e.07. 1 .P.Seq 


F 


M00005761C:F02 


CH01COH 


313 


429992 


RTA00002668F.C.2 1 . 1 .P.Seq 


F 


M00052921B:H08 


CH08LNH 


314 


374410 


RTA00002674F.k.l l.2.P.Seq 


F 


M00039t58B:GI2 


CH09LNL 


315 


376553 


RTA00002674F.g.!9.l.P.Seq 


F 


M00059159A:C09 


CH09LNL 


316 


429096 


RTA00002666F.fi 6.1. P.Seq 


F 


M00052573A:G06 


CH03LNH 


317 


181948 


RTA00002665F.n.05.1. P.Seq 


F 


MOOO250O3CD07 


CH03MAH 


318 


378475 


RTA00002672 F.h.0 1.2. P. Seq 


F 


M00059006D:B01 


CH09LNL 


319 


427336 


RTA00002665F.C.25. 1 .P.Seq 


F 


M0002S:i0B:D02 


CHOSLNH 


320 


374042 


RTA00002672F.a.08.2.P.Seq 


F 


M00058o5tC:B10 


CH09LNL 


321 


386543 


RTA00002672F.f.l5.2.P.Seq 


F 


M0005S^9B:GII 


CH09LNL 


322 


376659 


RTA00002678F.h.l 1.2.P.Seq 


F 


M0005^475C:E10 


CH09LNL 


323 


29155 


RTA00002665F.C.09. 1 .P.Seq 


F 


M00021^23C:DI 1 


CH03MAH 


524. 


377967 


RTA0000268 i F.m. 1 7.2.P.Seq 


F 


M00059S^7D:CI0 


CH09LNL 


325 


431530 


RTAO0OO2668F.m. 1 6.2.P.Seq 


F 


M000550~4A:C08 


CHOSLNH 


326 


373824 


RTA00002680F i. 1 9.2-P.Seq 


F 


M0005^S03D:H02 


CH09LNL 


327 


50094 


RTA0000266 1 F.j.02.2.P.Seq 


F 


M000045*3A;3I.O 


CHOICOH 


323 


214272 


KTA00002664F.h.0:v2.P.Seq 


F 


M000:7566A:Fll 


CH04MAL 


329 


377295 


RTA00002630F.b. t T.2.P.Seq 


F 


M000;^"*7C:E05 


CH09LNL 



IS 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


330 


195053 


RTA00002663F.n. 1 6. 1 P.Seq 


F 


M000230J4B:D02 


CH03MAH 


331 


21274 


RTA00002709F.m.09. 1 .P.Seq 


F 


M00007I94A:B09 


CH02COH 


332 


376580 


RTA00002675F.b.20. 1 .P.Seq 


F 


M000392I2CC12 


CH09LNL 




374725 


RTA00002673F.f.02.2.P.Seq 


F 


M00039070D:C02 


CH09LNL 


334 


25238 


RTA000027 1 0F.n.08. 1 .P.Seq 


F 


M00022634D:C08 


CH03MAH 


335 


377337 


RTA0OO02683F.1.07.2. P.Seq 


F 


M00040085D:AI0 


CH09LNL 


336 


450485 


RTA0000269 2 F.a. 1 3 .2 . P.Seq 


F 


M00042625C:B04 


CHI SCON 


337 


21989 


RTA00002709F.h.22. 1 .P.Seq 


F 


M00006861B:F09 


CH02COH 


338 


58296 


RTA0000266 1 F.i.20.2.P.Seq 


F 


M00004354D:E05 


CHOICOH 


339 


379144 


RTA00002679F.1. 1 4. 1 .P.Seq 


F 


M00039705D:F02 


CH09LNL 


340 


379690 


RTA00002680F.b.2 1 .2. P.Seq 


F 


M00039778B:G03 


CH09LNL 


341 


379640 


RTA0000268 1 F.d. 12.2. P.Seq 


F 


M00039859C:G10 


CH09LNL 


342 


373988 


RTA00002673F.h.23.1. P.Seq 


F 


M00039079A:A05 


CH09LNL 


343 


373988 


RTA00002673F.h.23.2.P.Seq 


F 


M00039079A:A05 


CH09LNL 


344 


380673 


RTA00002673F.j.l3.2.P.Seq 


F 


M00039084C:H03 


CH09LNL 


345 


55243 


RTA0000266 1 F.i.06.2. P.Seq 


F 


M000042S2D:CI 1 


CHOICOH 


346 


40557 


RTA000027I3F.h.2l.l. P.Seq 


F 


M00027398C.F07 


CH04MAL 


347 


375467 


RTA00002677F.m.03. 1 P.Seq 


F 


M00OJ9417A:D03 


CH09LNL 


348 


398406 


RTA00002679F j.02. 1 .P.Seq 


F 


M00O:^689C:EO8 


CH09LNL 


349 


430392 


RTA0000266SF.k. 1 9. 1 . P.Seq 


F 


M00035037D:C1 1 


CHOSLNH 


350 


376746 


RTA00002674F.f. 12.1. P.Seq 


F 


•M00039I33B:F08 


CH09LNL 


351 


1 15595 


RTA000027 1 3F.e.07. 1 .P.Seq 


F 


M00027297A:C04 


CH04MAL 


352 


377182 


RTA000O2682F. 1.1 1.1. P.Seq 


F 


M00040010A:FIO 


CH09LNL 


353 


380659 


RTA00002634F.e.07.2. P.Seq 


F 


MOOO^)I24D.HOI 


CH09LNL 


354 


373862 


RTA0000267 1 F.g.0 1 . 1 .P.Seq 


F 


M0003S284B:H04 


CH09LNL 


355 


376096 


RTA00002677F.b. 16.2. P.Seq 


F 


M00039340A:D05 


CH09LNL 


356 


372887 


RTA00002670F.d.05.2.P.Seq 


F 


MO0O33358A:H12 


CH09LNL 


357 


378475 


RTA00002672F.g.24.2.P.Seq 


F 


M0003900(?D:BOI 


CH09LNL 


358 


427336 


RTA00002665F.c.23.3.P.Seq 


F 


M0OO2S21OB:DO2 


CHOSLNH 


359 


373814 


RTA00002672F.b.02.2.P.Seq 


F 


M00O38635A.GO9 


CH09LNL 


360 


215506 


RTA00002664F.h.03.2.P.Seq 


F 


M0002 7 438C:G07 


CH04MAL 


361 


374465 


RTA00002673F.C.07.2. P.Seq 


F 


M00039058C:H02 


CH09LNL 


362 


428784 


RTA00002667F.C 1 8. 1 .P.Seq 


F 


M00032744B:F10 


CHOSLNH 


363 


379581 


RTA00002676F.a.2 1 .2. P.Seq 


F 


M0003P273B:F02 


CH09LNL 


364 


378371 


RTA00002678F.f20.2.P.Seq 


F 


M00039465A:A08 


CH09LNL 


365 


375154 


RTA00002676F.C. 13.2. P.Seq 


F 


M00039279B:H02 


CH09LNL 


366 


431214 


RTA00002669F.k.04. 1 .P.Seq 


F 


M00O332b2D:Al 1 


CHOSLNH 


367 


376053 


RTA00002675F.I.03. 1 P.Seq 


F 


M00039249A:C12 


CH09LNL 


368 


373282 


RTA00002680FJ. 19.2. P.Seq 


F 


M00039S13B:DM 


CH09LNL 
— 


369 


33397 


RTA0000266 1 F.h.04. 1 .P.Seq 


F 


M00004168A:G1 1 


CHOICOH 


370 


376706 


RTA00002675F.C. 02.1. P.Seq 


F 


M00039213B:F05 


CH09LNL 


371 


378292 


RTA0000268 1 F.i.09.2. P.Seq 


F 


M00039880A:H1 1 


CH09LNL 


372 


431612 


RTA00002669F.e.23.3.P.Sea 


F 


M0003_<202D:G06 


CHOSLNH 


373 


378471 


RTA000026"9F.o. 1 7. 1 P.Seq 


F 


M0003 t: C2~C:309 


CH09LNL 


374 


378666 


RTA0OO0:6SlF.i.O5.2.P.Seq 


F 


M00039S79C:F0; 


CH09LNL 


375 


374894 


RTA00002o75F.f.04.l. P.Seq 


F 


M00039Z24AEI2 


CH09LNL 


376 


430191 


RTA00002o67F.j.24. 1 .P Seq 


F 


M0003:829SE06 


CHOSLNH 
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SEO 
ID 


CLUSTFR 


SEO NAME 


ORfFNTTATlON 




f IRD A'OV 
LlDKAKY 


117 


4i ojo 1 


RTA0O007AA7F r 17 1 p C Pfl 
i\ 1 nU'/uy-uo / r -C 1 1 .r\ocq 


c 
r 


\ A AAA"* 0*7"^ ft \ * » At 

»viUU0j2 / jvA. AOo 


/"*UAOi vtt f 

CHOSLNH 


378 


J ' 7J70 


RTA 00007A7QF L- 0 % I P *\^n 


c 

r 


\A AAA"" flAOTQ ■ C 1 1 

(ViUUUjVoV / o. r 1 i 


CH09LNL 


379 


*on 

■* J JUU 


RTA0nnfP7 1 OF t 77 1 P 

IV. I rtV'J'VU— / 1 vr J.— J. 1 .1 . OCq 


P 


ivlUuU224 j4 U. UUo 


if A ^ villi 

CHOjMAH 


380 

■J \J v/ 


73030 


RTA0000^709F h 10 1 P Spa 


F 
r 


MUUUU3 J04rA.V- 1 1 


LHU2LUH 


381 

JO 1 


"57007$ 


RTAOOOO^/wOF rt OA 1 P <s*«n 


F 
p 


MUUU :V /2UU. UU2 


PlfAAl Vlt 

CH09LNL 


J o*. 


d^n to 1 


pTA 00007 AA7F W 0 1 1 P <Z*rt 


F 
P 


!Vj<AAA 07Q D - CA£ 

IVIUUU j2o2V b. huo 


/™*T IA O I v[| 1 

CHOoLNH 


383 

JOJ 


J i 4UO*t 


RTAOOOO^fiTSF a 0"> 1 P 


F 
P 


IVIUUUjVJJo A.dUj 


PUAAI VII 


384 


*7^77R 
J / J / — 0 


RTA0000"*A7AF h OS 7 p C,»ri 


F 

r 


M0U0jy2VVo.U 1 2 


\ IA A 1 V 1 1 

CH09LNL 


385 




RTA00007A70F h OX 7 p Q Pn 


F 
P 


VvfAAA"* ?A£ n\. LI AO 

MUOu j j j L/o U . n U V 


/"» if AO 1 VII 


J OV-r 




RTA00007A7";F i 1 7 1 P ^r»ri 


F 

r 


\/IAAA A A O \ t — ■HA'* 

MU00jVUo4L . HOj 


LIAA T VII 

LH09LNL 


JO* 


J / 07 Jo 


RTA0O0O7A7QC U *>n 1 P 


F 

r 


\A AAA - A7A1 t.DII 

M0U0jv/02A.d 12 


plIAA T Vlt 

LH09LNL 


JOO 


77^ 1 1 > 


RTA00007A77 C 0 1 1 P 


c 
r 


V4AAA A AA// n.TAO 

M000j9066D-O0S 


T 1 A A T V 1 f 

CH09LNL 


JO" 


J / 00 / J 


DTA 00007ABnC n 7 1 7 D Q-in 
K 1 rtllUUU-OoUr .p.— 1 -^.r.ocq 


P 


MO00j98j8A:r03 


/"* T r A A r vtt 

CH09LNL 


7 on 


■2 77QAQ 


R 1 AUUUw20 i{Jr .a. i 2.2.r.b£q 


p 


M00OjjjO0D:H12 


CH09LNL 


JV I 


T71~* AA 


K 1 AU'JvU-O /4r.C.J I . I .r.oC0| 


c 

r 


M000j9126D:A08 


CH09LNL 


707 


J /9j 18 


K 1 AUOOU^Oo j r .n. 1 o.2.r.i>eq 


F 


» ^ /> /\ /-y | A A «4 | Lift 

M0004007I B:A10 


CH09LNL 


j9j 


** to t r\ 
j7Sj 19 


R 1 AOO'JO-oo I r .K.U/.2.r.5eq 


F 


MOOOj9S90A:HO^ 


CH09LNL 


J94 


-» T ,1 £.r\0 

j /4608 


R I AU0UU26/ jr.g.20. l.P.beq 


F 


M000j92j0A:A10 


CH09LNL 


JVJ 


j74j2o 


R 1 A0Q0'J-o/jr.c.24.2.P.5eq 


F 


MOOO.^9061 B:F08 


CH09LNL 


j9t) 


j74j2S 


r> -T- 4 r\/\ -) ^ -» r— j <-\ i Trie _ 

RTA000O26/ jr.d.Ol 2.P.Seq 


F 


M000j906l B:F08 


CH09LNL 




no t a 1 

428401 


r> -p * AAAA7££7C U. r\ — 1 n C 

K 1 AUOUUJoo / r .b.O / . 1 .P.Seq 


F 


M000j272^C:F06 


CHOSLNH 




1 j6202 


K I AOfJUUJoo / r.p.Uj.2.P.5eq 


F 


MOOO-i0j49D:B09 


CH14EDT 


100 


j 74 j 94 


RTA00«JU_o / jr .c. I j. 1 .P 5eq 


F 


M000j90^9C;G08 


CH09LNL 


,1AA 

4UU 


j / 784 


R I AU0*JU2 /Oor C. 1 / .LP.beq 


F 


MOOO0j816D:t 1 1 


CH01COH 


4U 1 


, 70101 


R I AUUUU-oo 1 r .n. 1 1.1 .r.beq 


F 


MO00j98y6D:H09 


CH0°LNL 


den 


1 oDooj 


d x \ nrvnro 7nc « t77 dc 
R 1 AwllU. / I2r.p. 1 / .2.r.beq 


c 
r 


M000- f 1 / 8B:G09 


CH04MAL 


jot 

4Uj 


14366 


DX \ AAAA"*7AQC A 1 1 1 D C 

R 1 AOUOU- /Uy r .a. 1 4. 1 .r.beq 


F 


* 1 AAA A ~ £ ~\ ^ l~\ t 

M0000:62jD:G 12 


i~ * 1 tA - ! /*AI f 

CH02COH 




joj JU2 


d x \ n rt nn7£" T n r l- at *> d 

R 1 AUUUUJo /Ur .K.U / .2.r.oeq 


c 

r 
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CH09LNL 


.iO> 


I j4oj 


R 1 AUUUU-. /UVr .1. 1 o. 1 .P.beq 


F 


MOOOObbj /C:G0^ 


CHO-COH 


dOA 


7 1 7T,t 
2 12 /4 




p 


\ | A AA A T 1 A 1 * . t~5 <"\ A 
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I J /4j 
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4Uo 
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c 
r 


JVIOOO- .'SjoQ:r 12 


CH04MAL | 


dOQ 


4-1 0 J 04 


DX A 00007 AA7F r- OQ 1 P Q^i 
R I r\UUUU_00 / r .CUV. 1 . r.icq 


c 
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H-IAAA^IT* TO CAA 


f— * i Art f v f I I 

CHOSLNH | 


d 1 0 


4 j I o2y 


DT A 0000">AAQC 1 1 A 7 D Q-»/i 


c 
r 


& A A A A ■* " T7 £ D . A O 

MOOO j j 2 7 6 B . O 0 o 


CHOSLNH | 


ill 1 
41 1 


J / 7 / j4 


DXAOnOO^^Q^C h OQ 1 D 


c 
r 


ft 1 AA A "* HA O "* T~\ ■ \ A C 

MU00 j ^98 j U . A 06 


/~* l I A(\I VI I 

C H09LNL | 


d 1 7 

41. 


4 J lOU 1 


DT AOOOO^^AOF L' 09 7 p Cp. n 


c 
r 


\,f AAA ''"•^^"•D,^ — A I 


f~" If A O 1 VIU 1 

LHUoLNH | 


*T 1 J 


"57n7JO 
J / J /4V 


DTAOOOO^A^OF f 7 p 


c 
r 


MUUUj w / V^L/:OUo 


LHU^LNL | 


d 1 d 


J / 0 /04 


DTA OOOO^ A QtF i OJ 7 p C-n 


r 
P 


liifAAA "* AO \? 1 \ l_I 1 t 

(ViUUUj ,lJ So4 A. ri I I 


r* UAA I v ; i 

C H09LNL 


4 1 j 


2 1 joUD 


RTA00002664F.i.20. 1 .P.Seq 


F 


M0002 64 , C:D0.' 


CH04MAL 


416 


376144 


RTA00002675F.J.09. 1 .P.Seq 


F 


M0003^241A:E1 1 


CH09LNL 


417 


373071 


RTA00002670Fj.23.2.P.Seq 


F 


M00033442A:D06 


CH09LNL 


418 


379684 


RTA0000268 1 F.c.09.2. P.Seq 


F 


M0003°851B:GI 1 


CH09LNL 


419 


379610 


RTA00002680F.k.l 1.2. P.Seq 


F 


M0003OSI5C:F09 


CH09LNL 


420 


22392 


RTA00002708F.a. 10. 1 .P.Seq 


F 


M0O00! 395 D:K02 


CH01COH 


421 


377555 


RTA00002683F.1.08.2. P.Seq 


F 


M00040085DE04 


CH09LNL 


422 


32624 


RTA00002713F.f.I5.1. P.Seq 


F 


M0O0:~347C:GO~ 


CH04MAL 


423 


375024 


RTA000026"*5F.p. 1 2. 1 P.Seq F 


M0003^266DFi2 


CH0 Q LNL 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ;d 


LIBRARY 


424 


374725 


RTA00002673FY.02. 1 .P.Seq 


F 


M00039070D:C02 


CH09LNL 


425 


376228 


RTA00002676FT. 1 9.2. P.Seq 


F 


M00039293A:H04 


CH09LNL 


426 


375906 


RTA00002675F.i. 18.1. P.Seq 


F 


M00039238D:A08 


CH09LNL 


427 


I36I90 


RTA000027 I4F.a.04. 1 .P.Seq 


F 


M00027729D:H06 


CH04MAL 


423 


57694 


RTA000027 1 3 F.t'.02. 1 .P.Seq 


F 


M000273I9D:B1I 


CH04MAL 


429 


7007 


RTA00002709F.d08. 1 .P.Seq 


F 


M000056I4B:B0I 


CH02COH 


430 


400084 


RTA00002685F.o.I9.2.P.Seq 


F 


M0003964ICD07 


CH12EDT 


431 


375643 


RTA00002676F.h. i 8.2. P.Seq 


F 


M00039301B:F06 


CH09LNL 


432 


166493 


RTA00002663 F.h. 08. 1. P.Seq 


F 


M00022492CA02 


CH03MAH 


433 


379632 


RTA00002682F.h. 14. 1 .P.Seq 


F 


M00039934B:G12 


CH09LNL 


434 


373234 


RTA00002676F.g. 1 5.2.P.Seq 


F 


M00039297C:H03 


CH09LNL 


435 


401230 


RTA00002685F.i.05.2.P.Seq 


F 


M00039533A:CI2 


CH12EDT 


436 


186623 


RTA000027 l2F.f. 15. 1 .P.Seq 


F 


M00026843B:DIO 


CH04MAL 


437 


127714 


RTA000027i2F.k.I4.1.P.Seq 


F 


M00027018A:C09 


CH04MAL 


438 


451357 


RTA00002692F.a.O 1 . 1 .P.Seq 


F 


M00042584B:CI0 


CHI SCON 


439 


404620 


RTA000026S7F.c.03.2.P.Seq 


F 


M00039770A:G1I 


CH14EDT 


440 


186872 


RTA00002663F.k.23.l. P.Seq 


F 


M00022797B:G08 


CH03iMAH 


441 


42729 


RTA00002709F.c.06.2.P.Seq 


F 


M0OOO5458A:FI 1 


CH02COH 


442 


373380 


RTA00002674F.b.07. 1 .P.Seq 


F 


M000J9I23A:B10 


CH09LNL 


443 


374465 


RTA00002673F.C.07. 1 .P.Seq 


F 


M00039058C:H02 


CH09LNL 


444 


403557 


RTA00002687F.d. 1 0.2.P.Seq 


F 


M0003994SA:E03 


CH14EDT 


445 


16749 


RTA00002709F.b. 14. 2. P.Seq 


F 


M000054023:F08 


CH02COH 


446 


375592 


RTA000026S0F.f.~>2.2.P.Seq 


F 


M00039795D.EI0 


CH09LNL 


447 


376103 


RTA00002676F g.06.2.P.Seq 


F 


M000392953:D03 


CH09LNL 


443 


40228 


RTA000027I2F.1. 13.1. P.Seq 


F 


M00027049B:F05 


CH04MAL 


449 


374606 


RTA00002673F.J.23. 1 .P.Seq 


F 


M0003W6A:A05 


CH09LNL 


450 


378270 


RTA000026S0F.h,08.2. P.Seq 


F 


M0003^S01A:HII 


CH09LNL 


451 


236321 


RTA00002663F.k. 14. 1 .P.Seq 


F 


M0O033O34C:FO2 


CH08LNH 


452 


378676 


RTA00002680F.m.20.2.P.Seq 


F 


M000J^S27B:FO7 


CH09LNL 


453 


373252 


RTA000026 70F.k. 1 6.2. P.Seq 


F 


M0O03345I A: HOI 


CH09LNL 


454 


384601 


RTA00002670F.k.06.2. P.Seq 


F 


M00033446C.G08 


CH09LNL 


455 


403772 


RTA000026S7F.3.03.2. P.Seq 


F 


M0003°746C:G09 


CH14EDT 


456 


379566 


RTA00002683F.k.04,l. P.Seq 


F 


M0004008ICEOI 


CH09LNL 


457 


136202 


RTA000026S7F.p.05. 1 P.Seq 


F 


M00040349D:B09 


CH14EDT 


453 


14317 


RTA00OO27 13 F.c.l 3.1. P.Seq 


F 


M000:7243A:C02 


CH04MAL 


459 


375349 


RTA00002672Fj.l I.I. P.Seq 


F 


M000.- L >024B:B10 


CH09LNL 


460 


403020 


RTA00002637F.a.02.2.P.Seq 


F 


M000;o-46C:A08 


CHI4EDT 


461 


374060 


RTA00002672FJ.07.I. P.Seq 


F 


M0003 ll O!4B:C04 


CH09LNL 


462 


133399 


RTA000027 1 2 F.o. 1 0. ! .P.Seq 


F 


M00027136C:C09 


CH04MAL 


463 


373789 


RTA0000267 1 F.c.20. ! P.Seq 


F 


M0003S259B:G08 


CH09LNL 


464 


20168 


RTA0000271 1 F.b.22. 1 .P.Seq 


F 


M000::S.-4B:G1 1 


CH03MAH 


465 


452641 


RTA0O0O2692F.d.05.2.P.Seq 


F 


M000-^003C:D08 


CHI SCON 


466 


431370 


RTA00002660F.rn.0-:. 2. P.Seq 


F 


M000J;:SSB:DI2 


CH08LNH 


467 


153044 


RTA000O27 I3F.J.03. i P.Seq 


F 


M0O0:~4~6A:CO9 


CH04MAL 


468 


378229 


RTA00002679F.C. 16.2. P.Seq 


F 


M000;°bo3C:G09 


CH09LNL 


469 


374328 


RTA000O2673F.d.0 1.1. P.Seq 


F 


MOOO.^OolBiFOS 


CHO l >LNL 


470 


39606 


RTA000027l3F.i.20.i. P.Seq 


F 


M00O:~46SA;CO9 


CH04MAL 
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SEQ 
ID 


CLUSTER 


SEO NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


471 


59077 


RTA 0000^7 1 3 F.n.O 1 . 1 . P Seq 


F 


M00027596CE06 


CH04MAL 


472 


1935 


RTA000027IOF.b. 1 I.I.P.Seq 


F 


M00008006B:B03 


CH03MAH 


473 


379684 


RTA0000268 1 F.c.09. 1 .P.Seq 


F 


M0003985lB:Gl I 


CH09LNL 


474 


45 1 564 


RTA0000269 IF.f.l 2.2. P. Seq 


F 


M0004341 ID:H06 


CH17COHLV 


475 


7571 


RTA00002710F.a.l5J.P.Seq 


F 


M00007943D:C09 


CH03MAH 


476 


P93"»3 


RTAOOO0">7 1 3F.k.2 LI .P. Seq 


F 


M00027525B:D06 

* ■ X^ X^ X^ r ^0 * ■ ■ X^ X^ 


CH04MAL 


477 


P960 


RTA000027l0F.a.23.LP Seq 


F 


M00007976A CIO 


CH03MAH 


478 


186730 


RTA0000^713F O.05-1 P. Seq 


F 


M000 n 764iCA03 


CH04MAL 


479 


59077 


RTA0000^713F m 24. LP Seq 


F 


M000' > 7596CE06 


CH04MAL 


480 

T xJ \J 


1 8*884 

I *J_ UO~ 


RTAO000°7PF b 06 1 P Sea 


F 


M000^3316CG08 


CH04MAL 


— -' 

481 


19471 


RTA0000°708F <» 08 1 P Seq 


F 


M00004197BH10 


CH01COH 


48^ 




RTA00007 7I0F c 06 1 P Sea 


F 


M00008063 B ■ A06 


CH03MAH 

Nil A 1 w w* * T A ■ ■> ■ A 


48*5 


404 ">S 7 


RTA0000°687F <» 06 0 P Sea 


F 

■ 


M00040708 A C03 


CH 14EDT 

V_fi 111* Am A 


4X4 


177997 


RTA 0000^679 F n 04 I P Sea 


F 

1 


tM 000 397^9 A* A 10 


CH09LNL 


dXS 

HO J 




RTA0000^7I >F k 16 \ P Sea 


F 


MOOO"'75' , OA C05 


CH04MAL 


HOO 




dt AOOOO^ftRlF h n "* P Sea 
iv 1 rtuuw-oo ' r .11. 1 J— .r 


r 




ru 14FDT 


tlft7 


Ax") 1 O/l 




F 


M0n04" , 9S^ A * F06 


v» r • 1 0 v_ \_/ 1 t 


d5l8 


9dOU 
— 4UJ4 


RTA00nn77 1 OF h 06 1 P Sda 


F 


lU000n799^r F06 


CH0 »iVf AH 


13 7 




RTAOnftO^ASOF ^ IS 1 P Sea 


f 




CH 1 ^CON 


H7U 


dO 1 977 
H*J 1 O / » 


RTAOnnn^ARAF r 1 P Sea 


f 


M00040 1 4 1 D - FOS 
I'lvuu^u i4iu,r v/j 


CH 1 iEDT 


dQ I 


7/: i -c ■? 
J /Ojjj 


RT4nflftn''ft74F a 1 9 ** P Sea 


F 
■ 


M0n0*91 i9 A T09 


CH091 NI 


dQ7 

47- 


4jjUj I 


RTAfinOfT'^OUF t 07 I P Sea 


F 


M0004^S9< A • A 1 I 


CW0COHLV 


dQ> 


I £7A0 
1 o /ou 


RTAOOOO^TOSF i 0 3 I P Sea 


F 


M00004193 R- F07 


CH0 ICOH 


dQ4 


1 74 1 74 

J / H \ { H 


RTAOOOO^hT^F i 1 ^ n P Sea 


F 
i 


MO0O >90l < A DO 7 


CH09LNL 


49 S 


^7478"; 


RT 40000^67^ F k ^ I ^ P Sea 


F 


M000^9030BEO° 


CH09LNL 


496 
47U 


^7S777 


RTAOftOO^^S 1 F n ^4 1 P Sea 


F 


M000 9909C • Ct 0 5 


CH09LNL 


497 


•tTA 1 t 7 

J f OH 1 / 


RTA0O0Q°678F t 03 n P Sea 


F 


M0Q0"9477n- A 1 Q 


CH09LNL 


498 




RTA0000^666F 0 0^ I P Sea 


F 


M000">767SCD06 


CH08LNH 


499 


>9409R 


RTAOO0O" ) 68IF i 15 I P Sea 


F 

1 


MOOO' ; 9887C'E07 


CH09LNL 


son 


179761 


RTAOO00 n 670F n 03 1 P Sea 


F 


M000^3561C- AO 0 


CH09LNL 






RTA0O00" , 674F i 08 I P Sea 


F 


MOOO>9144CE06 


CH09LNL 


J U— 


J / — 74U 


RTA0000" , 670F 1 07 1 P Sea 


F 


MO00""4^7D- A0 S 


CH09LNL 


503 


2^8909 


RTAO000°664F.e.O8, 1 .P.Sea 


F 


M0002708>C:EI I 


CH04MAL 


504 


H — / 


RTA0000^66^F e 05 1 P Sea 


F 


M000^83MD A03 


CH08LNH 


505 


380413 


RTA0000^680F k 1 9 2 P Seq 


F 


MOOO"9816CD05 


CH09LNL 


506 

t^f V 


373866 


RTAOO0O"'67IF c.24 "*.P Seq 


F 


M00038259C:H09 


CH09LNL 


507 




RTA0000^66^F 15 I P Sea 


F 


M000^86PCAP 


CH08LNH 


50$ 


373000 


RTA00002670F.J. 15. LP. Seq 


F 


M00033437C:C03 


CH09LNL 


509 


378838 


RTA00002678F.p.I L2.P.Seq 


F 


M00039637C:AIO 


CH09LNL 


510 


24945 


RTA000027 10F.p.05. 1 P.Seq 


F 


M00022739A:B03 


CH03MAH 


51 [ 


20277 


RTA000027I0Fe.l7.LP.Seq 


F 


M0002I972D:CI 1 


CH03MAH 


512 


20820 


RTA000027 1 OF.e.02. 1 . P.Seq 


F 


M0002I9I9C:AIO 


CH03MAH 


513 


376791 


RTA00002674F.1. 1 7.2.P.Seq 


F 


M00059I66B:G06 


CH09LNL 


514 


9809 


RTA000027l0F.g.l2.LP.Seq 


F 


M0002217SB:D06 


CH03MAH 


515 


429562 


RTA0O002667F.m.03. LP.Seq 


F 


M00032S55D:G12 


CH08LNH 


516 


12920 


RTA00002 7 1 OF.e. 1 5. 1. P.Seq 


F 


M00021964C:E10 


CH03MAH 


517 


377565 


RTA00002684F.h. 1 9. LP.Seq 


F 


M00040309A.EI I 


CH09LNL 
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SEO 
ID 


CLLISTFR 

vV'V J 1 CIV 


SEO NAME 


ORIFNTATION 


CI ONF ID 


I IRR A D V 
1- 1 DIVA IV I 


J IO 


4707, *i6 

*+*£ 7 J JO 


dt A0OO0^665tF rl 1 P *x.»n 


r 


(VIUUUJ-VJ J A.v- iu 


C LI AO 1 VI 1 1 




477A7J 


RT AOOOO"'66 , >F f 09 1 P *Nf-n 


c 

r 


YrinAfl791AQ,rV FA51 
iVIUUUJo JOVLz. CU5 


C LI AO 1 VI U 


■J — V 


4777 1 » 


RTAfin00" , 66>F £ "*'» 1 P <x,»a 


C 

r 


\ji A A A7 9 1 • C A 9 
fVlUUU— o J0*+v. .OUO 


C LI AO l x r i j 


S7 1 


7776A7 


R T Afinft0767JF H 1 S 7 p c,, n 


c 

r 




LJ AO 1 VII 


S77 




dta nn00767-lF a 14 1 P ^.»n 


r 


MAAAIO 1 OAZ3-UAA 

iviuuu j y i y o o . nuo 


fLIAOI Vlf 


S7"? 


47Q7.6 1 


RTA00n0 n 666F H 1 1 1 P 


r 


KA A A A 1 7 % > A Pk - C A7 
IVIUUU J -3 3 UU.LUi 


riJACl Vf LJ 

LnUoLpJH 


^74 


1 -O / 3**> 


dta nnno">66 if i 16 i p *x»»n 


r 


\A AAAA9A^I < A * LI A 7 
lVIUUUUoU43 A.rtU- 


C LI A *! \A A LI 

v- nUJivlAri 


J — J 


il7 9A*l7 




r 


iviuuu j 1 4 1 / l . uuy 


^ UA9 1 X ) LI 


J-O 


1 9 9 At 
1 350-7 


DTA AfinO "> 7flQ F H K 1 P Q^n 


c 
r 


\A AAAAC£1 •! \ .r"AO 

iVlUU003 t>Z 3 A . v_ vl 


C LI r\^i~T\\A 

LHUiCvjH 


J— / 


■? 7A7A i 


DTAnnnn7A7nF n oi 7 p 


c 

r 


fcylAAA ^ ^ A 1 • \ A 7 

|ViU00jj3O I L . AUz 


/™*LIAAT Xfl 


x79 

7_£o 


/1AJA7 

404U / 


dt a rinno">66^r r \C\ \ X> 


r 

r 


\A AAA7 O t AA \A^ 

M 0002 S I V 0 L) . A 0 j 


/~LIAOI XILI 




Z 1 J03 


DTAAnnn7 7rtQF Ir 06 1 D 


r 


\.<AAAA7A 1 7 . LI AO 

ivlOOOO/0 1 JU:H05 


f ii 1 1 
LH02COH 


3_>U 


42 /400 


K I AUuUU-OD3r .0. 1 1 . 1 .r.oeq 


r 


h 4AAAH O 1 O 1 P\. /** 1 A 

M000-X l54L?:0 10 


r 1 u ao i x i li 


J J 1 


A AA7 £ -C 

4U0203 


D T \ AOnOlAQ < C r* f\1 ") D C 

K 1 AUUUUZoisjr .C.Uj.-.r.ocq 


r 


%. A AAA " ft.-* *T 1 O.OA1 

M000j9j74r3:B07 


C H 1 2 E DT 
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J J- 


*» o a a c ^ 
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D T \ AArkrt7K9fiC -» 1 A 1 D C .„ 

K I AUUUU-ioJSUr .a. 1 o. J.r.beq 


c 
r 


M000j9/ / jD:F 1 1 


LJ A A I VII 

C H09LNL 


•*7 7 
J J J 


j/3j24 


D T \ AAAA"? A7Q C 1 1 ~i 7 D C 

K I AU>JUU-0 / or .1. 1 -i.J.r.bcq 


c 
r 


iVi000j96123:B 10 


/^LIAAI Vfl 

CH09LNL 


3->4 


23 163 


D T \ AAAA1 7 I AC |, 17 | DC 

R I A000U- / 1 Ur K. 1 / . 1 .r.beq 


r* 
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M000224963:E 12 
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Aft 1 

40 1 296 
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M000j9329lD0/ 


CH 12EDT 


C~* il 


"** A tAAO 

_> 94098 


OT \ AAAA"»ilO 1 C i 1 i "> OC^^ 

RTA0U00.1OO 1 r j. 1 3._.r.beq 


F 


MO0Oj988 - C:E0/ 


CH09LNL 




174j0 


K 1 A0U0UJ ; I Or .1. 1 1.1 .r.oeq 


r 


M00022j63U:A0j 


f f/V ■*** k * * f f 

CHOjMAH 


C "* o 




K 1 A000O-D /4r .a.Uo. 1 .r.beq 


F 


M000j9I2 / A:G 1 1 


CH09LNL 


3j9 


"* TO z to 


KTAOOOO-0 /Jr.g. 1 4.J.r'.beq 


r 


M000j9004 3:L 1 1 


/"* I IAA I Vfl 

CH09LNL 


340 


2226/9 


K I A0000ioo4r r. 1 o._.r.bcq 


F 


M000- / 22 8 D: AO I 


CH04MAL 


C ,1 I 


_> 76874 


DT \ AAA/P£TAC j ~> 1 D C -v ^, 

K I AUU0U-O /Or .e._ j.J.r beq 


r 


MOOO^ j j 7> A : u 04 


/~- | f AA* V*I 


34 1 


2 1 j29 


DT A AAAAT7AOC k AQ t D C -v^, 

K 1 AUUUUJ /UVr.D.U5. 1 .r.o^q 


r 

r 


M00003->79A: t04 


/"* LI A 7 r\ LI 




1 19903 


DT \ AAAA*)"' t AC n I 1 I D C^^i 

K 1 A000UJ / 1 Ur .p. 1 j. 1 .r.bcq 


c 
r 


HAAA1110;r (~* A c 

M000-2753L :O0o 


/^LIA"*\f \ 11 

LHOjivIAH 


344 


.> /702o 


DT,\ AAAA7A79C n 7 t 7 D 

K 1 AOUuUJO / 5r .n._ 1 ._.r bcq 


c 
r 


\ (AAA ~ f\C "* 1 \ . 1 A 

iV1000j96j (ALIO 


UAAl xil 

CHOyLNL 


343 


J /j_>3 1 


D T A AAAA"* AT I C 1 1 Q t D C 

K 1 AOUOU-O / 1 r .1. 1 5. j.r.ieq 
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r 




t_i /\o i xfl 
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r 


\ ,i aaa ■* ~* c 7 a rn A 
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LHUyLiNL 


33 1 
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.SJAAA^^OAOD./™ i A 

ivIUOU j - oOo B . U 1 0 
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33*+ 


3 /30JJ 


D T A AAAA"*A" T 7F m AS "> P 

rv t AUUUU— o i / r .m.uj — r ocq 


r 


X <t AAA"* O.I t T □ CAI 

tvlUUU. , 941 /D ^UI 


LI AO 1 Xfl 


■c *c ^ 
333 
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punni xn 


556 


1761 96 


RTA00OO" , 674F m P 1 P Sea 


F 

• 


M00fl*Q170C r 0S 




557 


3751 15 


RTA00002673F.d.24.2.P.Seq 


F 


M00039066DGO8 


CH09LNL 


553 


375! 15 


RTA00OO2673F.e.01.2.P.Seq 


F 


M00039066D:G0S 


CH09LNL 


559 


378600 


RTA00002679F.i.03.1.P.Seq 


F 


M000:9686C:E06 


CH09LNL 


560 


375351 


RTA000O:6S0F.e.l5.1.P.Seq 


F 


M00039792A.304 


CHOvLNL 


561 


25237 


RTA000027lOF.n.23.I.P.Seq 


F 


M00022671B:A08 


CH03MAH 


562 


193503 


RTA0000:663F.n.l5.I.P.Seq 


F 


MO0023039D:3O5 


CH03MAH 


563 


428268 


JvTA000O:667F.b.Ol.l.P.Seq 


F 


M00032724A.C05 


CHOSLNH 


564 


379440 


RTA000O:683F.j.2L2.P.Seq 


F 


M00040080C C06 


CH09LNL 



1^0 
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565 



566 



567 



568 



569 



570 



571 



374502 



2406 1 5 



79207 



427893 



377530 



429707 



427610 



RTA00002673F.i,03»2. P.Seq 



M00039030CH06 



CH09LNL 



RTA00002672F.e.l9.2.P.Seq 



M00038995D:E05 



CH09LNL 



RTA00002670F.b.07.2.P.Seq 



M00033306D:G08 



CH09LNL 



RTA00002665F.k. 1 9. 1 .P.Seq 



M0003I4I9D:C04 



CH08LNH 



RTA00002684F.g.l9.2.P.Seq 



M00040305A:DI I 



CH09LNL 



RTA0000266SF.C. II.! .P.Seq 



M000329I8C:BI0 



CH08LNH 



RTA00002665F.1.04. 1 .P.Seq 



M00023770A;D04 



CH08LNH 



572 



573 



100699 



RTA00002662F.b.22.2. P.Seq 



M00006680B:D02 



CH02COH 



378974 



RTA000026S2F.rn.2 I.I. P.Seq 



M000400I7A:C06 



CH09LNL 



574 



575 



576 



577 



373607 



RTA00002674F.d. 15. 1 .P.Seq 



M00039I27D:EI0 



CH09LNL 



26295 I 



RTA00002665F.d.04.3.P.Seq 



M0002S215D:F03 



CH08LNH 



30748 



RTA00002713F.e.lL I .P.Seq 



M0002730IB:B08 



CH04MAL 



161 116 



RTA000027 1 4F.c. 11 . KP.Seq 



M00027837CD09 



CH04MAL 



578 



3792 1 1 



RTA000026S2F.p.20. 1 .P.Seq 



M00040029A:G04 



CH09LNL 



579 



580 



430689 



RTA00002669F.L24. 1 .P.Seq 



M00033243B:A05 



CH08LNH 



374122 



RTA00002673F.I.22.2.P.Seq 



M00039I04D:C09 



CH09LNL 



581 



376521 



RTA00002677F.h.06.2. P.Seq 



M0OO3939SA:BI0 



CH09LNL 



582 



58! 



372334 



RTA00002670F.b. 1 2.2.P.Seq 



M00033308B:G05 



CH09LNL 



379014 



RTA00002682F.o.02.l.P.Seq 



M00040022C;D06 



CH09LNL 



584 



585 



586 



376344 



RTA00002677F.b. 18.2.P.Seq 



M00039340B:G08 



CH09LNL 



376435 



RT A000026 76 F. f.0 1 .2. P.Seq 



M0003928SC:BI I 



CH09LNL 



21661 



RTA00002709F.e. 18.1. P.Seq 



M00005820C:E04 



CH02COH 



587 



538 



589 



590 



376539 



RTA00002675 F.b. 1 8. 1 .P.Seq 



M000392I IA:C12 



CH09LNL 



431645 



RTA00002669F.h. 1 5.3. P.Seq 



M00O33223B:HO7 



CH08LNH 



163293 



RTA000027l4F.c.20.1.P.Seq 



M0002S120D:FI2 



CH04MAL 



1 786 14 



RTA000027 1 3 F.c.20. 1 .P.Seq 



M0OO27263A.FI0 



CH04MAL 



591 



373274 



RT A 000026 70 F.i. 22. 2. P.Seq 



M00053432B:HIO 



CH09LNL 



592 



379820 



RTA00002679F.f. 15. 1 .P.Seq 



M00039677A:B05 



CH09LNL 



59: 



160536 



RTA00O02663F.f. 1 0. 1 .P.Seq 



M00022233C:AI2 



CH03MAH 



594 



j7jj I j 



RT A 0000267IF.m.02.1.P. Seq 



M0OO38328D:AO3 



CH09LNL 



595 



596 



597 



26429 



RTA000027 1 2F.k.23. 1 .P.Seq 



M00027022D:GI 1 



CH04MAL 



17983 



RTA00002711F.f.l0.1.P. Seq 



M00O22979A.DO5 



CH03MAH 



375388 



RTA0000268 1 F.j.22.2.P.Seq 



M00O39888B.DO3 



CH09LNL 



598 



599 



600 



63005 



RTA000027l2F.m.2I.I.P.Seq 



M00027094A.-BO: 



23030 



RTA00002709F.b. 1 0.2.P.Seq 



M00005384A:CI 1 



372946 



RTA00002670F.!.07.2.P.Seq 



M00033457D:A05 



CH04MAL 



CH02COH 



CH09LNL 



601 



375351 



RTA00002630F.e. 1 5.2.P.Seq 



M00039792A:B04 



CH09LNL 



602 



603 



604 



605 



606 



607 



608 



374502 



RTA00O02673F.i.0S.I.P.Seq 



M000390S0CH06 



CH09LNL 



37691 i 



RTA00002632F.e.09. LP.Seq 



M00039938C:A0S 



CH09LNL 



376024 



RTA00002675 F.n.15.1 .P.Seq 



M00039257D:C03 



CH09LNL 



377194 



RTA00002679F.h.20. 1 .P.Seq 



M000396S5A:A0S 



CH09LNL 



379643 



RTA000O2682F.g.08. 1. P.Seq 



M00039978A:G03 



CH09LNL 



79610 



RTA00002680F.U 1. 1. P.Seq 



M00039SI5C:F09 



CH09LNL 



25613 



RTA0000271 lF.g.06.1. P.Seq 



M00023024D:FI2 



CH03MAH 



609 



610 



611 



207466 



400052 



21290 



RTA00002664FJ.OS. I .P.Seq 



M00027733A:A02 



CH04MAL 



. RTA00002637 F.h. 13. 1 .P.Seq 



M 0004029 I D:C05 



CHI4EDT 



RTA0000271 2 F.g.0 1 . 1 P.Seq 



M000:6S59D:D01 



CH04MAL 
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612 



61 



614 



6I5 



616 



375975 



46804 



69863 



375285 



373000 



RTA00002675F.n. 1 8. 1.P.Seq 



RTA000027I2F.n.l9.LP.Seq 



RTA0OO027l2F.i.!8.l.P.Seq 



RTA00002676F.g. 1 8.2.P.Seq 



RTA00002670FJ. 1 3.2.P.Seq 



M00039258D:B08 



CH09LNL 



M00027I2ID:C05 



CH04MAL 



M0002693iC:B04 



CH04MAL 



M00039298B:B06 



CH09LNL 



M00033437CC03 



CH09LNL 



6I7 



378679 



RTA0000268IF.f.l6.I.P.Seq 



M00039869B:F06 



CH09LNL 



618 



45407 



RTA 000027 12F.k. II. I. P.Seq 



M000270I6A:B06 



CH04MAL 



619 
620 



1 6838 



RTA 000027 i 2F.e.23. 1 .P.Seq 



M00026803A:F08 



CH04MAL 



l 86425 



RTA000027 ! 3F.C.04. 1 .P.Seq 



M00027236A:E04 



CH04MAL 



62 1 



376485 



RTA00002676F.e.24.2.P.Seq 



M000392S8C:BII 



CH09LNL 



622 



41 I08 



RTA000027l2F.n.l2.I.P.Seq 



M00027I08C:B0: 



CH04MAL 



623 



430876 



RTA00002669F.C.02. 1 .P.Seq 



M00033I86CDH 



CH08LNH 



624 



1857I6 



RTA000027 1 3 F.l.07. 1 .P.Seq 



M00027537CB01 



CH04MAL 



625 



85338 



RTA000027l2F.b.lS.I.P.Seq 



M00023333D:CI2 



CH04MAL 



' 626 



185597 



RTA000027 i 3 F.m.23 . 1 .P.Seq 



M00O27596A:AI0 



CH04MAL 



627 



139348 



RTA000027 1 3 F.k.23. 1 .P.Seq 



M00027526D:F0: 



CH04MAL 



628 



454665 



RTA00002693F.d. 1 5.2.P.Seq 



M00043I64CE12 



CHI9COP 



629 



86387 



RTA00002713F.i.Ol.l.P.Seq 



M00027528C:BIO 



CH04MAL 



630 



186387 



RTA000027l3F.k.24.I.P.Seq 



M0002752SC:B10 



CH04MAL 



631 



21093 



RTA00002708F.h.20. 1 .P.Seq 



M0000430SC:C06 



CH01COH 



632 



20827 



RTA 000027 1 OF.c.23. 1 .P.Seq 



M00021671D:F12 



CH03MAH 



^ 1 



21290 



RTA000027 1 2FX24. 1. P.Seq 



M0002685^D:D0I 



CH04MAL 



634 



17646 



RTA 000027 1 0F.d.22. 1 .P.Seq 



M0002190SD:GI2 



CH03MAH 



635 



402817 



RTA00002686F.a. 10. 1 .P.Seq 



M00039736D:G08 



CH13EDT 



636 



42854 



RTA 00002 7 1 3F.n.09. 1 .P.Seq 



M00027615A:FIO 



CH04MAL 



637 



430876 



RTA00002669F.c.02.3.P.Seq 



M00033186C:D11 



CH08LNH 



638 



378641 



RTA00002679F.a.2 1 .2.P.Seq 



M00039655CE0S 



CH09LNL 



639 



375848 



RTA00002674F.m.03.2. P.Seq 



M0003916SC:A04 



CH09LNL 



640 



16165 



RTA00002708F.L06. 1. P.Seq 



M00004340C:C07 



CH01COH 



641 



456506 



RTA00002694F.d05. 1 .P.Seq 



M000434^2A;EOI 



CH20COHLV 



642 



374450 



RTA00002672F.i.05.2.P.Seq 



M00039OUA:HI0 



CH09LNL 



643 



378949 



RTA00002683 F.o.2 1 2.P.Seq 



M00040100D:B06 



CH09LNL 



644 



j7jj 1 j 



RTA 0000267 1 F.m.02.2.P.Seq 



MOOO3832SD:A03 



CH09LNL 



645 



77861 



RTA 0000268 IF.m.20.1. P.Seq 



M00039898A:A08 



CH09LNL 



646 



431 196 



RTA00002669F.f.07.2.P.Seq 



M0003320-JB:A07 



CH08LNH 



647 



372795 



RTA00002683F.a.06.1. P.Seq 



MO0040O32A:B03 



CH09LNL 



648 



42340 



RTA 00002661 F.b.OJ. 1 .P.Seq 



M0000l4?oC;H06 



CH01COH 



649 



374410 



RTA00002674F.U U. P.Seq 



M000591<SB:GI2 



CH09LNL 



650 



4623 



RTA00002674F.a.O 1 .2. P.Seq 



M000391ISD:A06 



CH09LNL 



651 



431612 



RTA00002669F.e.23.2.P.Seq 



MOOO3320ID:G06 



CH08LNH 



652 



240615 



RTA00002672F.e. 19.1. P.Seq 



M000589o<D:E05 



CH09LNL 



653 



42S508 



RTA 00002666F.d.O I . I .P.Seq 



M00032545B:H09 



CH08LNH 



654 



235780 



RTA00002666F.d.O?. I P.Seq 



M000325JfD:G05 



CH08LNH 



655 



17890 



RTA000027IOF.e.l 1.1. P.Seq 



M0002I9:5A:H02 



CH05MAH 



656 



20100 



R TA 000027 IQF.g.l 1. 1 .P.Seq 



MO0022riD:D1.2 



CH03MAH 



657 



4458 



RTA00002710F.g. 1 8. 1 .P.Seq 



M00022IS-C:CI 



CH03MAH 



658 



378347 



RTA00002681 F.h.07.2. P.Seq 



M00039S~:D:AI0 



CH09LNL 
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659 


373477 


RTA00002672F.b.23. 1 .P.Seq 


F 


M00058639B:C03 


CH09LNL 


660 


15596 


RTA000027 1 OF. 2.02. 1 .P.Seq 


F 


M00022135CB05 


CH03MAH 


661 


21028 


RTA00002709F.I.09. LP.Seq 


F 


M00007108B.A02 


CH02COH 


662 


374063 


RTA00002672F.h. 1 5.2.P.Seq 


F 


M0003901 1D:C10 


CH09LNL 


663 


380686 


RTA00002684F.a.05 .2.P.Seq 


F 


M 000401 07 B:H07 


CH09LNL 


664 


402950 


RTA00002686F.g. 11.1. P.Seq 


F 


M00040181B:H09 


CH13EDT 


665 


428064 


RTA00002665F.I.04. 1. P.Seq 


F 


M0003I485D:G02 


CH08LNH 


666 


23310 


RTA00002708F.e. 10.1. P.Seq 


F 


M00004046C;A08 


CH01COH 


667 


376233 


RTA00002677F.b. 1 5.2.P.Seq 


F 


M00039339CF03 


CH09LNL 


668 


375848 


RTA00002674F.m.03. 1 .P.Seq 


F 


M00039168C.A04 


CH09LNL 


669 


24225 1 


RTA00002665F.L08. 1 .P.Seq 


F 


M00028772CB09 


CH08LNH 


670 


374064 


RTA00002672F.f.l5.2.P.Seq 


F 


M00038999D:Cll 


CH09LNL 


671 


146260 


RTA00002663 F.d. 1 7. 1 . P.Seq 


F 


M00022099B:D06 


CH05MAH 


672 


375575 


RTA00002677F.e.22.2.P.Seq 


F 


M00039385B:E09 


CH09LNL 


673 


355518 


RTA00002665F.c.l5.3.P.Seq 


F 


M00028201B:H12 


CHOSLNH 


674 


184223 


RTA00002662 F.b.08.2. P.Seq 


F 


M00005539D:GOI 


CH02COH 


675 


213306 


RTA00002664F.e.07.2.P.Seq 


F 


M00027078A.B02 


CH04MAL 


676 


429566 


RTA00002668F.b.04. LP.Seq 


F 


M00032907A:G04 


CHOSLNH 


677 


378656 


RTA00002682F.C.09. LP.Seq 


F 


M00039927A:F04 


CH09LNL 


678 


427760 


RT A00002668 F.e.23 . 1 . P.Seq 


F 


M00052940A:C02 


CHOSLNH 


679 


372795 


RTA00002683 F.a.06.2.P.Seq 


F 


M00040032A:B03 


CH09LNL 


680 


429340 


RTA 00 002666 F.t'l 2. LP.Seq 


F 


M00032577A:C04 


CHOSLNH 


681 


429822 


RTA00002668F.e. 17. LP.Seq 


F 


M00032939B:E07 


CHOSLNH 


682 


375224 


RTA00002680F.d.22.2.P.Seq 


F 


M00039783B:A06 


CH09LNL 


683 


378347 


RTA0000268 1 F.h.07. LP.Seq 


F 


M00039875D:A10 


CH09LNL 


684 


380109 


RTA00002682F.i. 17. LP.Seq 


F 


M000399S"C:G08 


CH09LNL 


685 


379001 


RTA00002683F.O.02. LP.Seq 


F 


M00040097A:C12 


CH09LNL 


686 


375348 


RTA00002676F.i. 12.5. P.Seq 


F 


M00039304D:B09 


CH09LNL 


687 


377889 


RTA00002672F.c.08.2.P.Seq 


F 


M00038661A.A07 


CH09LNL 


688 


429883 


RTA00002667F.g.05. 1 P.Seq 


F 


M00052793A;F06 


CHOSLNH 


689 


377067 


RTA00002682F. 1.24. LP.Seq 


F 


M00040014B:DOI 


CH09LNL 


690 


378001 


RTA00002681F.m.22.2.P.Seq 


F 


M00039898D:C06 


CH09LNL 


691 


45298 


RTA00002710F.J.21. LP.Seq 


F 


M00022455A:E02 


CH03MAH 


692 


375431 


RTA00002680F.f.03. LP.Seq 


F 


M00039793D:C05 


CH09LNL 


693 


377861 


RTA0000268 1 F.m.20.2. P.Seq 


F 


M00039898A:A08 


CH09LNL 


694 


428610 


RTA00002667F.C.09. 1 .P.Seq 


F 


M00032766C:A04 


CHOSLNH 


695 


20765 


RTA00002710F.L 10. LP.Seq 


F 


M00022363C:G12 


CH03MAH 


696 


27601 


RTA000027 1 3F.e.23. 1 P.Seq 


F 


M00027314C:D09 


CH04MAL 


697 


430540 


RTA00002668F.o.20.2.P.Seq 


F 


M00035140D:F06 


CHOSLNH 


698 


381024 


RTA00002670F.h.23.2. P.Seq 


F 


M00033424B:A04. 


CH09LNL 


699 


16454 


RTA00002709F.L07. 1 .P.Seq 


F 


1X1000065^90:302 


CH02COH 


700 


372898 


RTA00002670F.i.03.2.P.Seq 


F 


M00053424D:H12 


CH09LNL 


701 


373681 


RTA00O0267 1 F.d.20. LP.Seq 


F 


M0005S272D:F1 1 


CH09LNL 


702 


82260 


RTA00002684F.h.06.2.P.Seq 


F 


M0004030"B:F0I 


CH09LNL 


703 


377343 


RTA00002684F.g.04.2.P.Seq 


F 


M00040302C:A04 


CH09LNL 


704 


374747 


JlTA00002676F.e.07.2.P.Seq 


F 


M000392S6A:C06 


CH09LNL 


705 


185848 


RTA00002" 1 2F.m. 1 I . LP.Seq 


F 


M000270SOA:B01 


CH04MAL 
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SEQ 
ID 


CLUSTER 


SEO NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


706 


37431 1 


RTA00002676F.e. 1 8.2. P.Seq 


F 


M000392S~C:A06 


CH09LNL 


707 


278923 


RTA00002667F.b. 1 0. 1 .P.Seq 


F 


M00032726CC0I 


CH08LNH 


708 


378667 


RTA0000263IF.b.l 1.2. P.Seq 


F 


M0003984~A:F06 


CH09LNL 


709 


380454 


RTA00002673F.J. 16.1. P.Seq 


F 


M0003908-O:D07 


CH09LNL 


710 


381576 


RTA00002670F.i.04.2.P.Seq 


F 


M00033425A:CI0 


CH09LNL 


711 


375067 


RTA00002675F.O.03.1. P.Seq 


F 


M00039260CG03 


CH09LNL 


712 


89706 


RTA000027 l4F.a. 1 1 . 1 .P.Seq 


F 


M000277413:F09 


CH04MAL 


713 


10583 


RTA0000271 1 F.h. 1 1 . 1. P.Seq 


F 


M00023100A:E12 


CH03MAH 


714 


379982 


RTA00002682F.i. 16.1. P.Seq 


F 


M00039987CEI2 


CH09LNL 


715 


378532 


RTA000026S0F.n.04.3. P.Seq 


F 


M00039828B:C05 


CH09LNL 


716 


379776 


RTA00002680F.a.22.2.P.Seq 


F 


M00039774CA03 


CH09LNL 


717 


374136 


RTA00002673F.f. 16. 1 .P.Seq 


F 


M00039072C:C03 


CH09LNL 


718 


98471 


RTA00002663F.J.2U. P.Seq 


F 


M00022670D:H1 1 


CH03MAH 


719 


P5365 


RTA00002668F.j.07. 1 .P.Seq 


F 


M00033019B:EI0 


CH08LNH 


720 


375431 


RTA00002680F.f.03.2. P.Seq 


F 


M00039793D.C05 


CH09LNL 




6*>S'>6 

VJ _ \J +m\J 


RTA0000266 1 F.g.20. 1 . P.Seq 


F 


M00004105D:D05 


CH01COH 


7^2 


379Q77 


RTA00002679F.e. 10. 1 .P.Seq 


F 


M00039672D:D10 


CH09LNL 


773 


j 77554 


RTA00002679F.f. 10.1. P.Seq | F 


M00039675D:B03 


CH09LNL 


7°4 


230479 


RTA00002664F.C. 16.2. P.Seq 1 F 


M000269I5B:C06 


CH04MAL 


775 


9887^ 


RTA 00002663 F.j. 19. 1. P.Seq | F 


M000226688:B12 


CH03MAH 


7^6 


4^635 


RTA00002679F.H. 1 8. 1 .P.Seq j F 


M00039634D:B08 


CH09LNL 


777 


179044 


RTA00002679Fa.10.2P.Seq 


F 


M00039652B:D05 


CH09LNL 


7^8 


96091 


RTA0OOO2663F.j.07. 1. P.Seq 


F 


M00022640C:C12 


CH03MAH 


770 


40 3 64^ 


RTA00002637F.d.0I.2.P.Seq 


F 


M00039945C:F09 


CH14EDT 


730 


40092 1 


RTA00002685F.b. 18.2. P.Seq 


F 


M00039371B:H06 


CH12EDT 


73 1 


93587 


RTA00002663 F.k. 10. 1 .P.Seq 


F 


M00022731A:D02 


CH03MAH 


732 


TQQS 1 

1 ✓ j «f 1 


RT.A000027l3F.cl8. LP.Seq 


F 


M00027258A:A07 


CH04MAL 


733 


176509 


RTA00002686F b.09.1. P.Seq 


F 


M000397563:H06 


CHI3EDT 


734 


451753 


RTA00002694F.e.06. 1 .P.Seq 


F 


M00043634A;C10 


CH20COHLV 


735 


1 86°66 


RTA00002713F.C. 16.1. P.Seq 


F 


M00027256B:H09 


CH04MAL 


736 


23505? 


RTA00002692F.a. 1 5. 2. P.Seq 


F 


M00042626B:D08 


CHI SCON 


737 


377233 


RTA00002682F.e.23. 1 .P.Seq 


F 


M0003 l »40D:GOS 


CH09LNL 


738 


37853? 


RTA00002680F.n.04.2. P.Seq 


F 


M00039828B:C05 


CH09LNL 


739 


177932 


RTA000027l3F.b.22.I.P.Seq 


F 


M00027233B;C0I 


CH04MAL 


740 


9537 

^ ^ 


RTA000027 1 2F.p. 1 3. 1 .P.Seq 


F 


M00027179D:E06 


CH04MAL 


741 


2403 1 8 


RTAO0002687F.d04.2. P.Seq 


F 


M00039947A:D06 


CH14EDT 


742 


404260 


RTA00002637F.C.1 1.2. P.Seq 


F 


M00039942D:COI 


CH14EDT 


743 


93767 


RTA000027 1 2F.2.09. 1 .P.Seq 


F 


M0002686SC:E1 1 


CH04MAL 


744 


185642 


RTA000027 1 2F.f.20. 1 .P.Seq 


F 


M000_6356D:F02 


CH04MAL 


745 


447544 


RTA00002639F.e. 18.3 P.Seq 


F 


M00042905D:D02 


CHI5CON 


746 


403274 


RTA00002687F.b. 10.2. P.Seq 


F 


M0003 l )766A:G07 


CHI4EDT 


747 


404257 


RTA00002687F.g.06. 1 .P.Seq 


F 


M00040203A:C03 


CH14EDT 


748 


403868 


RTA00002637F.k.05.2. P.Seq 


F 


M00040318C:K! 1 


CH14EDT 


749 


450074 


RTA0000269 1 F.e. ! 2.2 P.Seq 


F 


M000433*52D:C1 1 


CHI7COHLV 


750 


404520 


RTA00002687F.f.05.2. P.Seq 


F 


M00040:02A:F05 


CH14EDT 


751 


451789 


RTA00002692F.b.04.2.P.Seq | F 


M00042956C:B06 


CH ! SCON 


752 


455178 


RTA00002694F.b. 19. 1 P.Seq j F 


M0O0J3447A:CO7 


CH20COHLV 
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iVlUUlWUUV / A.L. 1 _ 


v. HU9LNL 


•7 c r 

OD 


j7476j 


RI AUUUU-0 / jr.p.-i. l.r.oeq 


r 


[VIUUUjVI lotJ.LUD 


C LIAAI VII 


/5o 


402^08 


OT \ AAAAOrfCOiCC' rt I < 1 D C Jn 

R I AUUUU-Ooor.O. 1 j . 1 .roeq 


c 
r 


.\,<AAA.I AT Ol Pi • Q A 1 

MUUU4Uio 1 V. dv I 


<- H 1 j fcUT 


/57 


4j 1 j70 


o t \ Art Ami iCnc m A,i l d 
R rAOUUUiooyr .m.U4.j .r.^eq 


c 
r 


HifAAA" 1 ^ O O O □ ■ Pi 1 *> 

MUUUj j2ood.U li 


Aunot v r it 

LHU8LNH 


/JO 


j80d00 


RTAUUUUio / Ur .p. 1 V. 1 .r\oeq 
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L nUVLiNL 
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"T "» A"^ A 

j7j020 


DT * AAAA1£*7 1 C U ^A T D 

R lAOUUUio / 1 r.D.JU.J.r.beq 


c 
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h jnAn' 1 ">;a; 4 . r~ 1 ■ 


Aurtor vri 


-»s -» 
7oj 


j7d2j 1 


OT \ AAAA1/C7 1 C m OA *> D Ca« 

R 1 AUUUU_o / 1 r .m.-:U.i.r.beq 


c 
r 


\.<AAA , 0"'0' , D. \ A"7 


C LI AQ I VII 


764 


t £_ | OA 

16 ISO 


DT * AAAA"1"7AQC I 1 "7 1 D C 

R 1 AQU0U_7UVr.j. 1 /. 1 ,r beq 


r* 

r 


XjIAAAAZA"** A. 4 A"* 

M 000069 / 1 D: AOj 


L HU2LUH 


76} 


"?A 1 A^ 

j 7940 j 


RTAOOUO-oojr .c. I / .Z.r.beq 


r 


M0004004 1 C . L 09 


rMAAi vil 
v. HU9LNL 


766 


37dj82 


RTA00002677r.d.24.2.P.5eq 


F 


M000j9jS 1 D:C02 


(-* If A A 1 VII 


767 


3796^j 


RTA0000268jF.c.Oj .2.P.Seq 


F 


M000400j8D:G04 


CH09LNL 


768 


377858 


RTA0000268 1 F.e. 14.2.P.Seq 


F 


M000j9S64A:A07 


CH09LNL 


769 


430861 


RTA00002668F.n. 1 8. 1 .P.Seq 


F 


M0O0j299^C:C0^ 


CH03LNH 


770 


376128 


RTA00002677F.a. 1 1 .2. P.Seq 


F 


M000j9j j4B:E0j 


CH09LNL 


771 


375009 


RTA00002676F.n.20.2. P.Seq 


F 


M000j9j2_A:F04 


CH09LNL 


772 


429816 


RTA0000266 / F.n.22. 1 . P.Seq 


F 


M000j28 / 1 D:E I ! 


CH08LNH 


773 


375657 


RTA0000268 IF.h.l j. 2. P.Seq 


F 


M000j98//C:C0j 


tr/\/\t vtf 

CH09LNL 


***** A 

774 


427839 


RTA00002666F.D. 14. 1 .P.Seq 


F 


M000j2^j0D:C02 


CH03LNH 


77^ 


376761 


RTA0000j6/ /F.g.0j.2.P.Seq 


F 


M000j9j91 D:F03 


CH09LNL 


776 


44025 


RTA00002o34F. b.24.2. P.Seq 


F 


M0C0401 1 ^3:A04 


tlAAI Vff 

CH09LNL 


777 


44025 


RTA00002684F.C.01 .2. P.Seq 


F 


M00040I I ^B:A04 


r IAA 1 Vff 

CH09LNL 


778 


392524 


RTA0000268 1 F. p. 04. 2. P.Seq 


F 


MOOOj9909D:C02 


CH09LNL 


779 


4272d2 


RTA0000266:>F.b. 1 j. 1 .P.Seq 


F 


^100023l8^B:A06 


CH0SLNH 


780 


374927 


RTA000026.' jF.e.l2J .P.Seq 


r— 

F 


M000j9068C:EO6 


I IAA 1 VII 


781 


j78226 


RTA00002680F.g.09. 1 .P.Seq 


F 


M000^9/9 ;C:G0^ 


/"UAAI VII 

C HU^LNL 


782 


2 1 7964 


RTA0000_664F.g.08.2. P.Seq 


F 


M00027299B:B 12 


LHU4MAL 


78j 


j76j68 


RTA000026/7F.b. 14.2, P.Seq 


F 


M000j9j--9A:H07 


lj a n r vrl 


784 


j777I9 


r» T t AAAA1£7^C ! )1 "IOC** — 

RTA0000267 /F.j. 1 1 .2. P.Seq 


F 


M000j940 / B:O02 




78d 


j7808I 


RTAOQOO-6 / 7F.e. 1 6.2. P.Seq 


F 


M000j9jo4C: E02 


/^LIAOf VTI 


786 


89267 


RTA0000-662F.D.0 1 .2. P.Seq 


F 


M0000d44: D:B0 1 


At JAl f~\ LI 


TOT 

787 


j 74927 


O T % AAAA ■^Zl'' r* t "> T D 

RTA00UU-67jr.e. 1 2.2.P.beq 


r 


\ < AAA"* AA£ O - tT AiC 

ivl000^9UoSC . bUo 


LI AO f Ml 


"TOO 

788 


279054 


DT \ AAAA"»iCiCTC W ID C v^-. 

RTAOUUU-oo/r.D.Jj. \ .r.beq 


c 
r 


MUU0j-7j I B.C. I U 




70A 

789 


j7/28j 


RTAOUUO-ooir.m. 1 y. I .r.ieq 


c 
r 


hjf AAA IAA I C LI 1 1 

MU0U4UU 1 61. H 1 1 


r'uAOi v;i 


790 


4>.>I8 


RTA00002 .* 1 Or .1.0^ . 1 .P.Seq 


r 


M0002 2 5 j A : A 0 o 


TJA "* \ 1 \ LI 


791 


188292 


RTAUUUU_oo4r ,e. 1 j ...r.Seq 


c 
r 


\ IAAAT7l£^D CAi 






J /oo/- 


t\ 1 MUvvU— OuJr.C — U — r.jcq 


F 


M00040042B:AIO 


rHOOl Nil 


793 


427252 


RTAO0OO2665F.b.I3.J.P.Seq 


F 


M00028I85B:A06 


CHOSLNH 


794 


330618 


RTA00002673F.j.l2.2.P.Seq 


F 


M000390S4CG07 


CH09LNL 


795 


35646 


RTA00002667F.2. 16.1. P.Seq 


F 


M00032797B:G02 


CHOSLNH 


796 


46407 


RTAO0O0:665F.c.l 0.3. P.Seq 


F 


M0002S!96D:A03 


CHOSLNH 


797 


373720 


RTA0000Z674F.C.04. 1 P.Seq 


F 


M00039124C:F03 


CH09LNL 


798 


429693 


RTA00002668FT.05.I. P.Seq 


F 


M00032944B:302 


CHOSLNH 


799 


377108 


RTA0000:673F.p.04.2.P.Seq 


F M00039636C Dl 1 


CH09LNL 



IS 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


800 


375657 


RTA0000268IF.h. 13.1. P.Seq 


F 


M00039877CC03 


CH09LNL 


801 


374868 


RTA00002673F.d.08.2.P.Seq 


F 


M00039063B:D08 


CH09LNL 


802 


428716 


RTA00002667F.e.08. LP Seq 


F 


M00032766B.D12 


CH08LNH 


803 


44025 


RTA00002684F.cOM. P.Seq 


F 


M000401I5B:A04 


CH09LNL 


804 


430327 


RTA00002668F.k. Ml .P.Seq 


F 


M00033033C:HOI 


CH08LNH 


805 


374328 


RTA00002673F.C.24. 1 .P.Seq 


F 


M0003906IB:F08 


CH09LNL 


806 


376946 


RTA00002682F.n. 10. 1. P.Seq 


F 


M000400I9A:E01 


CH09LNL 


807 


375522 


RTA00002677F.n.08.2.P.Seq 


F 


M00039420D:D03 


CH09LNL 


808 


395617 


RTA00002687F.b.l5.2.P.Seq 


F 


M00039767B:A04 


CH14EDT 


809 


21686 


RTA00002712F.g.05.1. P.Seq 


F 


M00026865B:A06 


CH04MAL 


810 


452038 


RTA00002692F.a.09.2.P.Seq 


F 


M00042623D:D07 


CH18CON 


811 


25632 


RTA0000271 1 F.g. 16. 1 .P.Seq 


F 


tV100023042D:D02 


CH03MAH 


812 


152487 


RTA00002663F.e. 12.1. P.Seq 


F 


M00022131C:D01 


CH03MAH 


813 


378226 


RTA00002680F.g.09.2.P.Seq 


F 


M00039797C:G05 


CH09LNL 


814 


402446 


RTA00002686F.C.04. 1 iP.Seq 


F 


M00040133B:B03 


CH13EDT 


815 


403642 


RTA00002687Fx.24.2.P.Seq 


F 


M00039945C:F09 


CH14EDT 


316 


186359 


RTA000027 1 3F.g.24. 1 .P.Seq 


F 


M00027379C:B07 


CH04MAL 


817 


404290 


RTA00002688F.e.04.2. P.Seq 


F 


M00040395B:DI 1 


CH14EDT 


818 


375443 


RTA00002676F.g. 1 9.2.P.Seq 


F 


M00039298B.D03 


CH09LNL 


819 


380279 


RTA00002673F.L24.1. P.Seq 


F 


M00039082B:A05 


CH09LNL 


820 


3861 10 


RTA00002687F.e.06. 1 .P.Seq 


F 


M00039955C:C04 


CHI4EDT 


821 


380279 


RTA00002673F.J.0 1.1. P.Seq 


F 


M00039082B:A05 


CH09LNL 


822 


386986 


RTA00002675F.p.06. 1 P.Seq 


F 


M00039266A:302 


CH09LNL 


823 


186359 


RTA00002713F.h.0i.l. P.Seq 


F 


M00027379C:B07 


CHG4MAL 


824 


375611 


RTA00002677F.o.20.2.P.Seq 


F 


M00039425D:E12 


CH09LNL 


825 


378285 


RTA00002679F.h.0 1 . 1 -P.Seq 


F 


M000396S1B:H09 


CH09LNL 


826 


44025 


RTA00002684F.O.24. 1 .P.Seq 


F 


M000401 15B:A04 


CH09LNL 


827 


25240 


RTA0000271 IF.c. 12.1. P.Seq 


F 


M00022S54A:303 


CH03MAH 


828 


403700 


RTA00002687F.g.03.2.P.Seq 


F 


M00040207B:D08 


CH14EDT 


829 


404679 


RTA00002687F.f.07. 1 .P.Seq 


F 


M00040203A:H06 


CH14EDT 


830 


454806 


RTA00002693F.D. I2.2.P Seq 


F 


M00043093C:G 1 1 


CH19COP 


831 


376829 


RTA00002674F.f.2 1 .2.P.Seq 


F 


M00039I35D:G02 


CH09LNL 


832 


456309 


RTA00002694F.d. 1 6. 1 .P.Seq 


F 


M00043518B:D06 


CH20COHLV 


833 


374510 


RTA00002672F.U7.2.P.Seq 


F 


M00039015D:H04 


CH09LNL 


834 


377232 


RTA00002683F.m.08.2. P.Seq 


F 


M00040090B:G09 


CH09LNL 


835 


375779 


RTA00002672Fj.20.2.P.Seq 


F 


M00039025A:H09 


CH09LNL 


836 


90746 


RTA0000267 1 F.a.07.2. P.Seq 


F 


M00033585D:A02 


CH09LNL 


837 


453002 


RTA00002692F.b.2 1 .2. P.Seq 


F 


M00042970C:H10 


CHI SCON 


838 


402863 


RTA0000" > 686F.n. P. 1 .P.Seq 


F 


M00040273B:HI2 


CHI3EDT 


839 


402526 


RTA00002636F.p.07. 1 .P.Seq 


F 


M000402S6CC02 


CH13EDT 


840 


412778 


RTA00002685F.i.07. 1 .P.Seq 


F 


M00039533D:F04 


CH12EDT 


841 


402273 


RTA00002686F.J. 18.1. P.Seq 


F 


M00040233C:G05 


CH13EDT 


842 


374744 


RTA00002670F.i. 16.1. P.Seq 


F 


M00033427D:F01 


CHO^LNL 


843 


375764 


RTA00002677F.0. 1 3.2.P.Seq 


F 


M00039425C:G0I 


CH09LNL 


844 


428218 


RTA00O02667F.C.0 1 . 1 P.Seq 


F 


M00032731CC07 


CH08LNH 


845 


374809 


RTA00002675F.h.0 1 . 1 .P.Seq 


F 


M00039230D:D09 


CH09LNL 


846 


20162 


RTA00002710F.n.20.I.P.Seq 


F 


M00022662D:G1 1 


CH03MAH 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


847 


375782 


RTA00002677F.d.23.2.P.Seq 


F 


M0003938;C:H08 


CH09LNL 


848 


372958 


RTA00002672F.C.02. 1 .P.Seq 


F 


M00O38639D:F07 


CH09LNL 


849 


403940 


RTA00002688F.d.07.2. P.Seq 


F 


M0004038~D:H05 


CH14EDT 


850 


8490 


RTA 000027 1 1F.2.03.1. P.Seq 


F 


M00023020CG08 


CH03MAH 


851 


374809 


RTA00002675F.g.24. 1 .P.Seq 


F 


M00039230D:D09 


CH09LNL 


852 


377788 


RTA00002684F.g.24.2.P.Seq 


F 


M00040305C:H06 


CH09LNL 


853 


13847 


RTA000027 1 1 F.f.09. 1 .P.Seq 


F 


M00022976C:F04 


CH03MAH 


854 


374172 


RTA00002673F.k. 1 6. 1 P.Seq 


F 


M00039097D:D06 


CH09LNL 


855 


380314 


RTA00002682F.1.07. 1 .P.Seq 


F 


M00040009D:B07 


CH09LNL 


856 


47231 


RTA000027 14F.b. 1 5. 1 .P.Seq 


F 


M00027813C:F01 


CH04MAL 


857 


400287 


RTA00002685F.k. 1 0. 1 .P.Seq 


F 


M0003958~C:C0l 


CH12EDT 


858 


400533 


RTA00002685F.a.02.2.P.Seq 


F 


M0003918!D:E05 


CH12EDT 


859 


447594 


RTA00002689F.C.07.1. P.Seq. 


F 


M000426963:E05 


CH 1 5CON 


860 


147357 


RTA000027 1 1 F.c. 1 5 . 1 .P.Seq 


F 


M00022928B:C0i 


CH03MAH 


861 


401141 


RTA00002685F.o.2~>:2.P.Seq 


F 


M00039642D:B12 


CH12EDT 


862 


404620 


RTA00002687F.C.03.1. P.Seq 


F 


M00039770A:GM 


CH14EDT 


863 


24360 


RTA00002709F. 1.20. 1 .P.Seq 


F 


M00007149A:G02 


CH02COH 


864 


380618 


RTA00002673F.J. 12.1. P.Seq 


F 


M0003908-CG07 


CH09LNL 


865 


448446 


RTA00002690F.d.09.3.P.Seq 


F 


M00042797D:D10 


CH16COP 


866 


4023 1 3 


RTA00002686F.f. 18.1. P.Seq 


F 


M00040I74D:G03 


CH13EDT 


867 


273151 


RTA00002685F.c.05.2.P.Seq 


F 


M00039374C:H02 


CH12EDT 


868 


404172 


RTA00002687F.d. 1 7.2.P.Seq 


F 


M0003995iB:BI2 


CH14EDT 


869 


263630 


RTA00002694F.e. 1 0. 1 .P.Seq 


F 


M00043637CHOI 


CH20COHLV 


870 


404277 


RTA00002687F.d. 1 3. 1 .P.Seq 


F 


M0003995iB:C03 


CH14EDT 


871 


403557 


RTA00002687F.d. 10.!. P.Seq 


F 


M00039948A:E03 


CH14EDT 


872 


375161 


RTA00002676F.m.24.2.P.Seq 


F 


M000393l9B:Hi2 


CH09LNL 


873 


376829 


RTA00002674F.f.2 1 . 1 .P.Seq 


F 


M00039135D:G02 


CH09LNL 


874 


372953 


RTA00002672F.c.02.2.P.Seq 


F 


M00038639D:F07 


CH09LNL 


875 


21578 


RTA00002709F.a.24. 1 .P.Seq 


F 


M0000535ICG05 


CH02COH 


876 


402506 


RTA00002686F.b. 1 7. 1 .P.Seq 


F 


M00039760B:BOS 


CHI3EDT 


877 


141731 


RTA00002713F.b.04.1. P.Seq 


F 


M000272iZD:E03 


CH04MAL 


878 


37411 


RTA00002661F.e.l I.I. P.Seq 


F 


M00003770A:E05 


CH01COH 


879 


372537 


RTA00002670F.c.05.2.P.Seq 


F 


M00033345D:A09 


CH09LNL 


880 


380834 


RTA00002670F.c.03.2.P.Seq 


F 


M00033346C:A05 


CH09LNL. 


881 


40 1 492 


RTA00002685F.n. 1 7.2.P.Seq 


F 


M00039609D:F07 


CH12EDT 


882 


99998 


RTA00002662F.b.23.2.P.Seq 


F 


M00006712C:H09 


CH02COH 


883 


4043 1 1 


RTA00002638F.d.2 1 .2.P.Seq 


F 


M00040394A:D04 


CH14EDT 


884 


231084 


RTA00002664F.C. 1 S.2.P.Seq 


F 


M000269ISB:D0I 


CH04V1AL 


885 


447679 


RTA000026S9F.b.l 1.3. P.Seq 


F 


M00042560A:FI2 


CHI5CON 


886 


377012 


RTA00002682F.d. 1 / .I. P.Seq 


F 


MO00j99^6v_ :C0? 


CH09LNL 


887 


226207 


RTA00002664F.d.21.2.P.Seq 


F 


MOO027O35 D:C06 


CH04MAL 


888 


446183 


RTA000026S9F.3. 12.1. P.Seq 


F 


M00042534A:A05 


CHI5CON 


889 


428508 


RTA00002666F.C. 24.1. P.Seq 


F 


M0003254:B:H09 


CH08LNH 


890 


1 57648 


RTA000027 MF b 20 1 .P.Seq 


F 


M000278tSC:C07 


CH04V1AL 


891 


404609 


RTA0000268SF.b. 1 5.2. P.Seq 


F 


M000403 77C:G07 


CH14EDT 


892 


400464 


RTA000026S5F.1. 10.1. P.Seq 


F 


M00039590D:D02 


CH12EDT 


893 


379108 


RTA000026S5F.L12J. P.Seq 


F 


M0OOJ959iC:DO6 


CH12EDT 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


894 


374639 


RTA00002676F.d.2 1 .2.P.Seq 


F 


M00039284D:BI2 


CH09LNL 


895 


380674 


RTA00002673F.j.l4.2.P.Seq 


F 


M00039084CH04 


CH09LNL 


896 


380674 


RTA00002673F.J. 14. LP.Seq 


F 


M00039084C:H04 


CH09LNL 


397 


188972 


RT A00002 664 F . d . 20. 2 . P. Seq 


F 


M00027030C.-H06 


CH04MAL 


898 


402835 


RTA00002686F.C.0 1 . 1 .P.Seq 


F 


M00040131D:G08 


CH13EDT 


899 


403774 


RTA00002687F.d.08.2.P.Seq 


F 


M00039947CG03 


CH14EDT 


900 


374606 


RTA00002673F.j.23.2.P.Seq 


F 


M00039096A:A05 


CH09LNL 


901 


192535 


RT A 00002 663 F . m . 1 4 . L P. Seq 


F 


M00022925CA08 


CH03MAH 


902 


377926 


RTA00002680F.U 6.2.P.Seq 


F 


M00039820B:B06 


CH09LNL 


903 


186055 


RTA000027I2F.U 1.1. P.Seq 


F 


M00026926A:E10 


CH04MAL 


904 


380498 


RTA00002684F.f.ll.2.P.Seq 


F 


M00040129D:E10 


CH09LNL 


905 


400236 


RTA0000268SF.L 1 8.2-P.Seq 


F 


M00039561 A:B07 


CH12EDT 


906 


401070 


RTA00002688F.d. l2.2.P.Seq. 


F 


M00040390A:H02 


■ CH14EDT 


907 


452622 


RTA00002692F.b. l4.2.P.Seq 


F 


M00042962D:C05 


CHI SCON 


908 


235052 


RTA00O02692F.a. 15. LP.Seq 


F 


M00042626B:D08 


CHI SCON 


909 


452221 


RTA00002692F.c.l3.2.P.Seq 


F 


M00042936C:G12 


CHI SCON 


910 


404581 


RTA00002687F.g. 1 1 .2.P.Seq 


F 


M00040203D:G09 


CH14EDT 


911 


376925 


RTA00002687F.e. I4.2.P.Seq 


F 


tV100059957C:C09 


CH14EDT 


912 


400237 


RTA00002685F.k.l0.2.P.Seq 


F 


M00039534C:C0I 


CH12EDT 


913 


403242 


RTA00002687F.I.05.2.P.Seq 


F 


M00040523B:C12 


CH14EDT 


914 


453313 


RTA00002693F.a.07.2. P.Seq 


F 


M00042614B:B05 


CHI9COP 


915 


452633 


RTA00002692F.L 1 1 .2. P.Seq 


F 


M00043067D:D10 


CHI SCON 


916 


447679 


RTA000026S9F.b. I L LP.Seq 


F 


M00042560A:F12 


CH 15CON 


917 


452398 


RTA00002692FT. 17. LP.Seq 


F 


M00043125C:AI 1 


CH! SCON 


918 


449797 


RTA00002691F.b.22.3.P.Seq 


F 


M00043334B:A10 


CH17COHLV 


919 


403916 


RTA00002687F.j.lL2.P.Seq 


F 


M000403!4D:H05 


CH14EDT 


920 


236906 


RTA00002693F.d.05.2.P.Seq 


F 


M00043154A:B07 


CHI9COP 


921 


404161 


RTA00002687F.e.20.2.P.Seq 


F 


M0005995SCB09 


CHI4EDT 


922 


3861 10 


RTA00002687F.e.06.2.P.Seq 


F 


M00039955CC04 


CHI4EDT 


923 


451512 


RTA0000269 1 F.b.02 J.P.Seq 


F 


M00043305B:G02 


CH 17COHLV 


924 


400517 


RTA000026S7F.k. !5.2.P.Seq 


F 


M00040320D.F02 


CH14EDT 


925 


403578 


RT A 00002 68 7 F . i . 0 1 . 2 . P. Seq 


F 


M00040296D:E09 


CH14EDT 


926 


403578 


RTA00002637F.h.24.2.P.Seq 


F 


M00040296D:E09 


CHI4EDT 


927 


403371 


RTA000026S7F.h. 19.2. P.Seq 


F 


M00040294D:D12 


CH14EDT 


928 


452531 


RTA00002692F.f. 16.1. P.Seq 


F 


M00043125A:BI 1 


CHI SCON ' 


929 


454453 


RTA00002693FT. !5.2.P.Seq 


F 


M00043215A:D02 


CH19COP 


930 


238270 


RTA00002692F.e.07.2.P.Seq 


F 


M00043023A:G05 


CHI SCON 


931 


14533 


RTA00002687F.f.0S.2.P.Seq 


F 


M00040:05B:A05 


CHI4EDT 


932 


400464 


RTA00002635F.LI0.2.P.Seq 


F 


M00039590D:D02 


CH12EDT 


933 


404642 


RTA000026S7F.f.02.2. P.Seq 


F 


M0004020IC:G11 


CH14EDT 


934 


330413 


RTA0000:630F.k. 19. | .P.Seq 


F 


M00039816C:D05 


CH09LNL 


935 


237963 


RTA00002693F.c.20.2 : P.Seq 


F 


M00045143C:A09 


CH19COP 


936 


20847 


RTA000027 1 OF.d.09. 1 .P.Seq 


F 


M000:iS52D:A05 


CH03MAH 


937 


456531 


RTA00002694F.b. 18.1. P.Seq 


F 


M00043446C:E!2 


CH20COHLV 


938 


450463 


RTA00002694F.3. 12. LP.Seq 


F 


M00042596C:D07 


CH20COHLV 


939 


456713 


RTA00002694F.d. 1 3. LP.Seq 


F 


M000435I3D:G08 


CH20COHLV 


940 


455503 


RTA00002694F.3. 1 5. LP.Seq 


F 


M000-i25^7B:El2 


CH20COHLV 
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SEQ 
ID 
941 


CLUSTER 
376133 


SEQ NAME 
' RTA00002674F.m.05.2.PSeq 


ORIENTATION 
F ' 


CLONE ID 
M00039I69A:E12 


LIBRARY 
CH09LNL 


942 


40233 1 


RTA000026S6F.m.03. l.P.Seq 


F 


M00040264D:G05 


CHI3EDT 


943 


373820 


RTA00002674F.d.06.2.P.Seq 


F 


M00039I27A:G1 I 


CH09LNL 


944 


85383 


RTA00002674F.c.06.2.P.Seq 


F 


M00039124C:H08 


CH09LNL 


945 


400732 


RTAOO0O26S5F.k.24.2.P.Seq 


F 


M0003958"C:FI2 


CH12EDT 


946 


43 1 629 


RTA00002669F.U4.1.P.Seq 


F 


M000332763:G08 


CH08LNH 


947 


449349 


RTA00002690F.d. 12.3.P.Seq 


F 


M00042802C:C04 


CH16COP 


943 


401 124 


RTA00002685F.o.ll.2.P.Seq 


F 


• M00039629D:B04 


CHI2EDT 


949 


453233 


RTA00002693 F.a.0 1 .2.P.Seq 


F 


M0004261 1A:A06 


CH19COP 


950 


124813 


RTA00002685F j. l0.2.P.Seq 


F 


iM00039564B:C0l 


CH12EDT 


951 


454627 


RTA00002693F.f.09.2.P.Seq 


F 


M000432IOC:E05 


CHI9COP 


952 


169464 


RTA00002663 F.i. 1 9. l.P.Seq 


F 


M00022602A:E09 


CH03MAH 


953 


451654 


RTA00002692F.f.02.2.P.Seq 


F 


M00043044D:A09 


CH hSCON 


954 


406092 


RTA00002685F.k.l!.2.P.Seq 


F 


M00039584C CI 1 


CHI2EDT 


. 955 


453501 


RTA00002693F.d.l4.2.P.Seq 


F 


M00043I62D:C12 


CHI9COP 


956 


450845 


RTA0000269lFT.lO.LP.Seq 


F 


M00043410C:A09 


CHI7COHLV 


957 


443177 


RTA00002690F e. 1 2. l.P.Seq 


F 


M00042839BB1 1 


CH16COP | 


958 


402617 


RTA00002686F.b.2 1. l.P.Seq 


F 


M0004013!3:D1 1 


CH13EDT 


959 


378014 


RTA00002680F.g. 1 7. 1 .P.Seq 


F 


M00039799A:D10 


CH09LNL 


960 


124313 


RTA0O0O2685F.j. 10. l.P.Seq 


F 


M00039564B.C01 


CH12EDT 


961 


29450 


RTA00002663F.d.07. l.P.Seq 


F 


M00022054A:H03 


CH03MAH 


962 


400486 


RTA00002635F.e.02. 1 .P.Seq 


F 


M000394963:D08 


CHI2EDT 


963 


44753 


RTA00002713FT.05. l.P.Seq 


F 


M00027324D:C05 


CH04MAL 


964 


443177 


RTA00002690F.e. l2.2.P.Seq 


F 


M00042339B:BI 1 


CHI6COP 


965 


447697 


RTA00002689F.e. 15. 3. P.Seq 


F 


M00042905A:FII 


CHI5CON 


966 


240318 


RTA00002687F.d.04. l.P.Seq 


F 


M00039947A:D06 


CH14EDT 


967 


451620 


RTA00002691 F.d.20.3.P.Seq 


F 


M00043379D:H02 


CHI7COHLV 


963 


400157 


RTA00002635F.i.20.2.P.Seq 


F 


M00059561 3:A09 


CHI2EDT 


969 


400276 


RTA000026S5F.h. 16.2. P.Seq 


F 


M00039523B:312 


CH12EDT 


970 


449779 


RTA0000269lF.d.04.3.P.Seq 


F 


V!00043367B:A03 


CH17COHLV 


971 


400157 


RTA000026S5F.L20. l.P.Seq 


F 


M0O03956IB.AO9 


CHI2EDT 


972 


238133 


RTA00002685F.e.03.2.P.Seq 


F 


M00039496 3:H09 


CH12EDT 


973 


452015 


RTA00002692F,c.07.2.P.Seq 


F 


M0004293IB:D1I 


CHI3CON 


974 


400732 


RTA00002685F.L0l.2.P.Seq 


F 


M00039537C.FI2 


CH12EDT 


975 


24984 


RTA00002711F.d.21. l.P.Seq 


F 


M000229IOA:A06 


CH03MAH 


976 


449040 


RTA00002690F.e. 14.2. P.Seq 


F 


M00042S4iD:H07 


CH16COP 


977 


377481 


RTA0000267iF.i.l5.3.P.Seq 


F 


M0003S303A;CO3 


CH09LNL 


978 


400910 


RTA00002685F.b.07. 1 P.Seq 


F 


M00039367BH02 


CHI2EDT 


979 


376945 


RTA00002682F.Ic.23. 1 .P.Seq 


F 


M00040007D:A06 


CH0 Q LNL 


980 


1 5906 


RTA00002709F.e. 14. l.P.Seq 


F 


M0O005305D.DI2 


CH02COH 


98 1 


452781 


RTA00002692F.b.t6.2.P.Seq 


F 


M000429663 F07 


CHI SCON 


982 


415294 


RTA00O026S6FT. 14. l.P.Seq 


F 


M00040 173D:305 


CHI3EDT 


983 


40 1 644 


RTA00002685F.n. 16. l.P.Seq 


F 


M00039608D:H0I 


CH12EDT 


984 


404402 


RTA00002637F.3. 19 2. P.Seq 


F 


M0003976IDE10 


CHI4EDT 


985 


401709 


RTA00002685F.n.24.2.P.Seq 


F 


M00039624A:H09 


CHI2EDT 1 


986 


401644 


RTA00002683F.n. 1 6.2.P.Seq 


F 


M0003960SD-.H0I 


CHI2EDT 1 


937 


452531 


RTA00002692F.f.l6.2.P.Seq | F 


M00043125A31 1 


CHI SCON*| 



39 
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I CIV 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


Voo 


.1 AAA 1 A 

4UUV 1 U 


RT A 0000^68 *F b 07 1 PSea 


F 


M00039367B:H02 


CH12EDT 


yor 




RTaOOO0' ? 69OF 3 P Sea 


F 


M00042439B:B03 


CHI6COP 


QQA 


4*47/7*+ 


R TA 0000^69 1 F c ^ 1 P Sea 


F 


M00043361B.A01 


CH17C0HLV 


QO 1 
77 1 


4UUVi 1 


RTAOOOO^nSSF b 18 1 P Sea 


F 


M00039371B:H06 


CHI2EDT 


QQ7 

77 — 


77t B7*l 
J / Jo /4 


RTAflf!00' , ^7^F c n i P Sea 


F 


M00O38663D:HI0 


CH09LNL 




iifs i a^a 

4U 1 U JU 


RTa 0000^685 F e 09 P Sea 


F 


M00039499C:A04 


CH12EDT 


774 


A ^ 1717 


RTAf)non" , 69"iF c 0^ "* P Sea 


F 


M00043108A:F06 


CH19COP 


77 J 


447—74 


RTAfinon^^QOF c 13 3 P Sea 


F 


M00042770C:C04 


CH16COP 


770 


4U4iOU 


RTAnftftfPfiR7F c 1 1 1 P Sea 


F 

* 


M0003994"'D*C0l 


CH14EDT 


OO" 

77 / 


J / oU 1 4 


rt a nnnn"* AXftF a 17 ^ P Sea 


F 


M00039799AD10 


CH09LNL 




,1 A,l *7t£ 

4U4 flO 


pta nnnn^ARSF a 18 ^ P Sea 


F 

■ 


M0004037' C H05 


CH14EDT 


QOO 


,t c 1 "* 1*7 

43 I J4 / 


RTAfinnfl">^Q t F h 1 1 I P Sea 


F 


M000433 1 IC E03 


CH17COHLV 


i aaa 


/I A 1 1 C 1 

4U 1 IM 


dt a nnnn^AXsP > oft ^ P Sen 


F 
■ 


M00039497CC06 


CH12EDT 


1001 


401S/0 


r>TA AAAA^AQAF U *)7 I P ^i»n 
K 1 AUUUU_OoOr . I -r. JtL( 


c 

I 


M00040I 3 1 C F03 


CH13EDT 


. 1002 


400 l 70 


OTA nArtn^AR^F h m ^ P *s»»n 


F 

r 


M00039366CB07 


CH12EDT 


100 j 


25j87 


Ota AAAf\"> 7 1 1 F f 1 0 1 P ^i*o 


F 


MOOO^jOO 1 ccos 


CH0 >M AH 


1004 


j7 /08:> 


O T A AAAA" 1 A79F n 1J 1 P ^t»n 


F 
r 


M000396 19BD0"* 


CH09LNL 


100^ 


40jDj0 


K 1 AUUUU-Ooor .a.U7._.r.oeq 


F 

r 


VI 00040" 68 A F0 1 


CH14EDT 


1006 


->729j0 


DT \ AAAA1A7AF i 1 7 P 

K I AUUUU-0 /Ur J. 1 r.oeq 


F 

r 


iVIO0OVi4" r C' A07 


CH09LNL 


1007 


401 120 


R 1 AUUUU-OO- r -C-J.--r.oeq 


F 

r 


Mftft0"?9 j 7 9 A BO 3 


CHPEDT 


1008 


40j397 


DT \ AAAA"*A^7C h A7 7 P Q^n 

R I AOUUU-Oo / r .n.U-.J.r.oeq 


r 


iV100040^ 1 9BD0 - ' 


CH14EDT 


1 A Aft 

1009 


449 j j7 


dt \ AAAA"*AQAF /• IS < P 


F 
* 


MOO04* , 7*"-ir-C0^ 


CH16C0P 


1010 


40jd61 


DT VAAAA"*<^^C f< AA 7 p Qjfi 


F 

r 


M00040jS~CE07 


CH14EDT 


101 1 


1 _>4 1 82 


D T A AAAA* , AQ7C A 1 "X n P *Sr*A 

R 1 AUUUU-OV^r .a. i J.-.r. jcq 


F 
r 


M000430 1 ! A H P 


CHI SCON 


1012 


j770oj 


D T \ AAAA" > A79C n 1 J 7 p 


F 


M000396 1 9B'D0"* 


CH09LNL 


101 j 


j761 j8 


D T \ AAAA^ATJF m AS I P ^i»n 


F 
i 


M00039I6^ A £ P 


CH09LNL 


1014 


401 1 34 


R 1 AUUUU— Oo Jr c.UD. I .r.^cq 


F 
i 


M000394°"C C06 


CHPEDT 


101 3 


44vg Jo 


dt a AAAA^/iO 1 F h IJ 1 P Sea 
K 1 AUUUU _07 1 r . V. l - . r. jcq 


F 
i 


M000433"0B' A07 


CH17COHLV 


1016 


40jS96 


DT \ AAAA7AQ7F t Oil 7 p C-*n 
R t AUUUU-Oo / r.a.ut — r. ,jcq 


F 

r 


M00039746CH05 


CH14E0F 


inn 

1017 


j tlbo2 


OTA AAAA^^Rl F I IS "* P Sen 


F 


M000400S - DF08 


CH09LNL 


1 0 1 o 


4D0843 


DT A AAAA^AO 1 F f 1 ft "* P ^»»n 


F 

r 


M0004^4 HXT- A09 


CH17COHLV 


1019 


4^0043 


dt A AAAA^AO IP f> 1ft ** P <!en 
K. 1 AvUUU—O" I r .c. i v — r. jcq 


F 
* 


M00043 >^ • A C 10 


CH17COHLV 


1 AO A 




OTA onnn"*nS6F d ^ 1 PSea 

JV l rtUvvv-UOUr ,U.— — . i -t - Jtt| 


F 

■ 


M00040147D:H1 1 


CHPEDT 


102 1 


427674 


DT a AAftfPfifiSF i lfl 1 P Sea 


F 

i 


M000"'877<DF03 


CH0SLNH 


l A17 




OTA AOAO^fiKSF r ] 5 "» P Sea 


F 


M00040383D:C04 


CHUEDT 


1 AT "* 

102j 


A C1A'' O 


dt a AnftfPAQ^F a DQ 1 P Sen 


F 

J 


M0004" , 6" , " ; DD07 


CHI SCON 


t AT t 


401 !>D.> 


DT AOAAO"*fiS^F d OS "* P Sea 


F 

* 


M000"948^3G02 


CH12EDT 


1025 


45 1092 


RTA00002691F.d.l7.3.P.Seq 


F 


M0004337'A:C03 


CH17COHLV 


1026 


403978 


RTA0000Z687F.2.09.2.P.Seq 


• - — 

F 


M0004020SB:A07 


CH 14EDT 


1027 


377 186 


RTA0000Z682F.m.07. 1 .P.Seq 


F 


M000400UD:F03 


CH09LNL 


1023 


404679 


RT A00002 687F.f.07.2P.Seq 


F 


M0004020:A:H06 


CHUEDT 


1029 


373875 


RTA00002674F.c05.KP.Seq 


F 


M00039124C:H02 


CH09LNL 


1030 


128841 


RTA00002685F.0. 1 5.2.P.Seq 


F 


M00039630C:H04 


CHPEDT 


1031 


33971 


RTA00002 7 1 3 F.h. 1 3 . KP.Seq 


F 


M000273«23:H02 


CH04MAL 


1032 


332878 


RTA00002666F.h. 13.1 -P.Seq 


F 


M00032^*C:B0I 


CH0SLNH 


1033 


400781 


RTA00002635F.j.03.2.P.Seq 


F 


M000395623:G02 


CHPEDT 


1034 


456456 


RTA00002694F.b.22. KP.Seq 


F 


M0004344O,A:£l2 


CH20COHLV 



10 
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seq| 

id i cluster 


SEQ NAME 


ORIENTATION 


CLONE !D 


I 

LIBRARY 


I035 


402337 


RTA00002636F.L07. 1 .P.Seq 


F 


M00040257D:H10 


CH13EDT 


I036 


401974 


RTA00002636F.i.I5.1.P.Seq 


F 


M00040223A:C05 


CH13EDT 


I037 


455141 


RTA00002694F.b. 14. 1 .P.Seq 


F 


tM00043440C:B07 


CH20COHLV 


1038 


402057 


RTA00002686F. 1. 14.1 .P.Seq 


F 


M00040260CD04 


CH13EDT 


1039 


402555 


RTA000O2686F.m. 14.1. P.Seq 


F 


M00040267C:C04 


CH13EDT 


1 040 


406092 


RTA00002685F.U 1.1. P.Seq 


F 


M000395S4C:C1 1 


CH12EDT 


104I 


374351 


RTA00002674F.L20.I. P.Seq 


F 


M00039147A:F10 


CH09LNL 


1 042 


40^365 


RTA000026S6F j.08. 1 .P.Seq 


F 


M00040230A:H02 


CH13EDT 


1 043 


4018^8 


RTA00002686F.J. 14. 1 .P.Seq 


F 


M00040232D:B07 


CH13EDT 


1 044 


447669 


RTA00002639F.a. 1 5.2.P.Seq 


F 


M000425JiB:E06 


CH15CON 


1 045 


407588 


RTA00002686F.k. 18.1 .P.Seq 


F 


M00040254B:C10 


CH13EDT 


1 046 




RTA00002686F.1.02.1. P.Seq 


F 


M00040256A.A06 


CHI3EDT 


1047 


4071 *>9 


RTA00002686F.i.20. 1 .P.Seq 


F 


M00040226A:H10 


CH13EDT 


1048 


40 1 766 


RTA00002686F.0. 1 6. 1 .P.Seq 


F 


M00040282A.A03 


CH13EDT 


1049 


407QO 


RTA00002686F.2. 1 4. t. P.Seq 

w 1 


F 


M00040181D:H10 


CH13EDT 


1050 




RTA00002690F.C. 10.3.P.Seq 


F 


M00042767B:G10 


CHI6COP 


105 1 


400 s^n 


RTA00002685F.2.04.2. P.Seq 


F 


M00039512CD06 


CH12EDT 


1057 


401^6? 


RTA00002637F.1C-05. LP.Seq 


F 


M0004031SCHI 1 


CH14EDT 


1051 


il017,t7 


RTA00002687F.1.05 . 1 .P.Seq 


F 


M00040323B C12 


CH14EDT 


10^4 


J07 1 $7 


RTA00002686F.f. 1 6. 1 .P.Seq 


F 


V100040174C:E 10 


CHI3EDT 


inss 


4*10760 


RTA00002690F.C. 12.3. P.Seq 


F 


M00042770B:B12 


CH16COP 


lO^fi 

1 v.Q 


JO 1 700 


RTA00002685F.n.!0.l.P.Seq 


F 


M00039606B:D08 


CH12EDT 






RTA00002690F.d.07.3.P.Seq 


F 


M00042790CC07 


CH16COP 


10^8 




RTA00002674F.i.20.:.P.Seq 


F 


M00039!47A:FIO 


CH09LNL 


1059 


44X464 


RTA00002690F.c.08.3.P.Seq 


F 


M00042765C:D04 


CHI6COP 


1060 


40 1 07Q 


RTA0OOO26S5F.p.05.2.P.Seq 


F 


M00039643CB04 


CH12EDT 


1061 


4010 1 6 


RTA00002687F.L 11. 1. P.Seq 


F 


M00040314D:H05 


CH14EDT 


106^ 


40 1 174 


RTA00002685F.p.07.2.P.Sea 


F 


M00039645CE01 


CH12EDT 


1063 


400501 


RTA00GO26S5F.k.O2. 1 .P.Seq 


F 


M00039570B:D10 


CHI2EDT 


1064 


7 1Q8'' 5 


RTA00002664F.h.06.:.P.Seq 


F 


M000273%D:G08 


CH04MAL 


1065 


37773^ 


RTA00002631F.p.09.:. P.Seq 


F 


M00039910C:G10 


CH09LNL 


1066 


380348 


RTA00CO2634F.d. 1 2. 1. P.Seq 


F 


M00040121 3:C05 


CH09LNL 


1067 


449549 


RTA00002690F.a.09.3. P.Seq 


F 


M0004243ICFOI 


CH16COP 


1068 


4077 71 


RTA000026S6F.f.05. 1. P.Seq 


F 


M00040169B:F08 


CH13EDT 


1069 


401777 


RTA000026S5F.o.23.2.P.Seq 


F 


M00039642D.H09 


CH12EDT 


1070 


379878 


RTA00002682F.H. 12.1. P.Seq 


F 


M00039934A:C02 


CH09LNL 


1071 


378602 


RTA00C026S 1 Fa.08.2-P.Seq 


F 


M00039839C:EO5 


CHO^LNL 


1072 


448065 


RTA00002690F c.22.3.P.Seq 


F 


M0004278I A:A07 


CH16COP 


1073 


403493 


RT A00002687 F.j.03 . 1 . P.Seq 


F 


M00040313D:E04 


CH14EDT 


1074 


4005 1 7 


RTA0OCO2687F.k. 1 5. 1 .P.Seq 


F 


M00040320D:F02 


CH14EDT 


1075 


456636 


RTA00002694F.e.05. 1. P.Seq 


F 


M00043632D:F09 


CH20COHLV 


1076 


400101 


RTA00CO2685F.O.04. 1. P.Seq 


F 


M00039625 B:G08 


CHI2EDT 


1077 


403578 


RTAOCC02687F.i.O !.!. P.Seq 


F 


M00040296D:E09 


CHI4EDT 


1078 


4024 1 9 


RTA0O0O2686F.g.2O. 1. P.Seq 


F 


M000401S4O.A1 1 


CH13EDT 


1079 


375161 


RTA00C02676F.n.0!.2.P.Seq 


F 


M000393!9B:H12 


CH09LSL 


1080 


401851 


RTA00002636F.d.O:.I.P.Seq 


F 


M00040!43A:H05 


CH13EDT 


1081 


400567 


RTA0OCO26S5F.a. 14.:.P.Seq 


F 


M00039361B:E0l 


CH12EDT 
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CCA 

tr\ 
\U 


LLUo 1 cK 


cpn MA V1F 


ORIENTATION 


CLONE ID 


LIBRARY 


1082 




D t \ i^n/\fi* > F H 0 1 "> P ^t*n 
K I AU</'.J , /_0 ' ' r .Q-l/ 1 .—.r. jcq 


F 
i 


M000 1 9 14 * A D09 


CH09I NL 


i no' 1 

Iuoj 


j76641 


K 1 AUUU'J-0 / / r .c.--*.-.r. jcq 


c 

r 


iVIfiOft^Qia- A D09 


CH09LNL 


t a o -i 

1084 


i a a .# e a 

400450 


dt \ rifiA/v^z o< t i ">"/ 1 P ^»*n 
K 1 AUU*JU_OttJr.J.--- 1 .r.ocq 


r 




CH PFDT 


t f\ n r 

1085 


^ C ** *T ~* 


K 1 ALMJUU-O / Or .n. 1 _. I .r.oeq 


c 

r 




ruoor NI 


1086 


^7Dj7j 


dt \ A.AnA*>A7AP W 1 "? 'J P ^^*n 
K I AUUUU-0 /Or n. 1 r .ocq 


F 
r 


M ooo 1 q " n o c • ro9 


CH09LNL 


1087 


41 _>64.> 


D*T* \nAAm£B?C n On ~* P ^*>n 


r 
r 


!VIOftftlQftO-l 0003 


CHPFDT 


1 AO o 

1088 


448874 


dt \ r\f\rkf\l £.Q(W r- O"' 1 P Q^n 
K I AUOUUJoVUr .c.U — >,r.:>eq 


r 
r 


iVioon4">7^cR n 1 1 


CHI6COP 


1089 




DT \ AAAA"! £7,1 F k 04. 1 P <\.*a 


F 
r 


M0001Q 1 4ft A • R0S 


CH09LNL 


1090 


j 74040 


R 1 AUOUUio Mr .n— i . i .r.oeq 


r 


iVIOOO^QM^DRl 1 


CH09LNL 


■ Art 1 

109 1 


4:>4 1 j2 


RTAUUUUioyjr.e. 1 o. i .roeq 


r 


N/100041 t Q • A A 07 


CH19COP 


1092 


404^81 


RT AOOOO-Oo / r .2. 1 l - 1 . r .:>eq 


c 
r 


Mooo4ftT0^ rvnoQ 


CH14EDT 


1093 


260521 


RTA0000265tt.c. 1 j. 1 .r.:>eq 


c 
r 


\/!0004" > 70* Q C.fil 




1094 


379564 


RTA0000268/r 0. 1 I .r.beq 


c 
r 


M00040^4A ATI 1 


ru 14FDT 


1095 


452491 


RTA00002692 r .r .UD .r.oeq 


c 
r 


V,IAAn4^04A P»* R t 1 




.1096 


403541 


RTA0000268 / F.p.2U.2.P.beq 


r 
r 


.VIAAftJfllA * A • FOS 


THI4FDT 


1097 


404636 


RTA00002688F -b. 1 1 .J.P.beq 


c 
r 


IV1UUU4U J / OK. .VJU- 


CU \ 4FHT 


1098 


379564 


RTA0000268 / F.o. 1 2.2.P.beq 


r 


MAA/^ lAT-lAA-n 1 
ivIUUlWU-MC A-C I 1 




1099 


45 1 548 


RTA00002691 F.b.0 g .j-P.beq 


c 
r 


ivlUUU-* j j 1 1, i_ .OUO 




1 100 


454308 


RTA0000269_>F.t.t4.l .P.Seq 


c 
r 




r u 1 ornp 


1 101 


401 184 


RTA00002685F.a.04.j.P.btfq 


c 
r 


\ lAnAioi f~-^*AO 




1 102 


401290 


RTA0000268D F.n. 1 0.2.P.5eq 


r 


ivlUUUjVoUC d.UUo 


rune nT 
v,n i-ci/ 1 


1 103 


400101 


RTA 00002685 F.o. 04. 2. P.beq 


r- 

r 


\ 1AAA"!Q£1 < >3 *f^A ^ 


run FDT 


1 104 


454308 


RTA0000269j FM 4.2.P.beq 


c 
r 


N.IAAA 1 1~> 1 '.DRP 
IVlUUU4j_ l.'D.Dl. 




1105 


452622 


RTA00002692F.D.14. l .p.beq 


c 
r 


\ IAAA l"*OA "* Pl-PAS 
tvlUUU4_V0_ t-/ V-U- 1 




1 106 


450012 


RTA00002691 F.d.Ov. j.P.beq 


c 
r 




ru 1 7COHLV 


1 107 


400503 


RTA00002683F.k.02.2.P.beq 


r 


x i aaa io ^*"r ^ n \ o 


ru PFDT 


1 108 


400450 


RTA000026S? F.j.J -.J.P.beq 


r 


VIAAAIO^ Tt A • n 1 0 




1 109 


446 1 66 


RTA0OOO-689r.c. 1 '.I.P.beq 


c 
r 


V,IAAA_1^7 1 A 1 I 




1 1 10 


456233 


AT" i AAAAO^A.I C » A O 1 Q C Jn 

RTA000O_694r e.Uo. \ .r.b^q 


c 

r 


VI AAA41A 1r> ^ Tflfi 


CFP0COHLV 


1 1 1 1 


25443 


f-k -r- t rtAA ATT 1 A C -4 1 "N 1 A Cjn 

RTAOOOO-VIOr.a. 1^. I.P.oeq 


c 
r 


VIAOO'* 1 SA^ A 01 




1112 


4041 19 


RTAOOOO-ooor.G. 1 .J.P.oeq 


c 
r 


VI0AA4010TR 1 


ru 14FDT 


1113 


403642 


AT*. AAAA1 ZIOTC J A 1 1 DCjrt 

RTAOOOO-00 /r .u.U l . 1 .r.xq 


p 

r 


^ ^ 10oo*^004• ; l~■• F09 


CH 14EDT 


1 I 14 


40j49j 


RTAOOOO-Oo /r. j. U-v^.r.^eq 


p 

r 


VI 000401 1 "DF04 


CH14EDT 


1115 


454132 


RTAOOOOZovjr .e. 1 o.,. r.beq 


. p 
r 


\,lO0O4l 1 Q I A ■ A 07 


CH 19COP 


1 1 16 


4^0607 


AT a AAAA^^O 1 C A 1 ^ ID 

R I A0UUO_oy 1 r.u. I j.r.^eq 


p 

r 


vioooii i7^r'nos 


CH 17COHLV 


kit 

1117 


451718 


RTAOOOO-ovZr e.24._.K -^eq 


p 

r 


.VI00041044 R* A 1 


CHI SCON 


ilia 
1 1 18 


43^907 


RTA0000-693F.b.0S.2.P.Seq 


F 


M0004lOS~3 GO 7 


CH19COP 


1119 


44/669 


RTA00002689 F. a. 15.3. P.Seq 


F 


N/10004"* S 1 *\R F06 


CHI ^CON 

v_ lit* ' 1 ^ 


1 PO 




RTA00002687F.p. 1 1 . 1 .P.Seq 


F 


M0004035iD:AM 


CH14EDT 


1 121 


449617 


RTA00002690F.e.l6.2.P.Seq 


F 


M00042S4^D:F1 1 


CH16COP 


1 122 


452723 


RTA00002692F.e. 13. 2. P.Seq 


F 


M00043036C:E05 


cm scon 


1123 


270014 


RTA00002685F.i.l5.2.P.Seq 


F 


M0003^536C:H1 1 


CHI2EDT 


1124 


401 193 


RTA00OO2685F.i. 14. 2. P.Seq 


F 


M0003 £ >53tC:C10 


CHI2EDT 


1 125 


452414 


RTA00002692F.e. 1 2. 1 P.Seq 


F 


M0004303:C:A10 


CHI SCON 


1126 


453019 


RTA00002692F.d.lS.:.P,Seq 1 F 


M000430I5A:H10 


CHI SCON 


1127 


403642 


RTA00002687F.C.24. 1 .P.Seq | F 


M0003994;C:F09 


CHMEDT 


1 123 


401437 


RTA00002685F.C.1S 2. P.Seq | F 


M000:^37-D:E12 


CH12EDT 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


N29 


452414 


RTA00002692F.e. 12.2.P.Seq 


F 


M00043032C:AIO 


CHI SCON 


1 130 


404122 


RTA00002687F.n. 10. 1. P.Seq 


F 


M00040334D:B02 


CHI4EDT 


U31 


400567 


RTA00002635RX 1 4. 1 .P.Seq 


F 


M0003936IB:E01 


CHI2EDT 


1132 


401437 


RTA00002685F.C. 1 8. 1 .P.Seq 


F 


M00039377D:E12 


CH12EDT 




404642 


RTA00002687FT.02. 1. P.Seq 


F 


M0004020iC:GU 


CH14EDT 


1134 


376007 


RTA00O02676F.f.22.2.P.Seq 


F 


M00039293B.C1I 


CH09LNL 


1135 


402835 


RTA00002686F.b.24. 1 .P.Seq 


F 


M00040I3ID:G08 


CH13EDT 


1136 


403774 


RTA00002687F.d.08. 1 .P.Seq 


F 


M00039947C:G03 


CH14EDT 


U37 


45505 


RTA000027 1 2F.d.04. 1 . P.Seq 


F 


M00023377B:F01 


CH04MAL 


1138 


452071 


RTA00002692F.c.05.2.P.Seq 


F 


M00042979B:E02 


CH 1 SCON 


1139 


449832 


RTA0000269 1 F.e. 13.1 .P.Seq 


F 


M00043393A:B08 


CH17COHLV 


1140 


379004 


RTA00002683F.n.09.2.P.Seq 


F 


M00040093B:C02 


CH09LNL 


U41 


455211 


RTA00002694F.b.07. LP.Seq 


F 


M00043430B:C02 


CH20COHLV 


1142 


379021 


RTA00002683F.n.l3.2.P.Seq 


F 


M00040093D:D03 


CH09LNL 


1143 


376279 


RTA00002680F.d. 10.2.P.Seq 


F 


M00039785D.G05 


CH09LNL 


1 144 


374373 


RTA00002631F.n.2 1.1. P.Seq 


F 


M00039903A:H07 


CH09LNL 


I 145 


97668 


RTA00002686F.d. 1 9. 1 .P.Seq 


F 


M00040145D:D03 


CHI3EDT 


1146 


400407 


RTA0O0O2685F.a.05.2. P.Seq 




M00039tS4A:D03 


CHI2EDT 


1147 


402904. 


RTA00002636F.n. 1 5. 1 P.Seq 


F 


M00040274A:H1 1 


CHI3EDT 


1143 


403912 


RTA00002687F.j. 19.1. P.Seq 


F 


M00040317A:H03 


CH14EDT 


1149 


4005 1 1 


RTA000O2685F.b.23.2.P.Seq 


F 


M00039372CD12 


CHI2EDT 


1150 


402746 


RTA00002686F.a. 14.1. P.Seq 


F 


M00039740B:FI0 


CH13EDT 


1 151 


403849 


RTA00002687F.n.09.2.P.Seq 


F 


M00040333D:G05 


CH14EDT 


1152 


401471 


RTA00002635F.0. 10.1 P.Seq 


F 


M00039629B:FOI 


CH12EDT 


1153 


404362 


RTA00002637F.o.06.2,P.Seq 


F 


M00040342B:DI2 


CH14EDT 


1154 


373641 


RTA00002677F.L09.2.P.Seq 


F 


M00039403A:GI2 


CH09LNL 


1 155 


401952 


RTA00002686F.J. 1 0. 1 .P.Seq 


F 


M00040231B:C08 


CH13EDT 


1156 


400685 


RTA0O0O2685F.m.09.2. P.Seq 


F 


M0003^5^7D:FO4 


CH12EDT 


1157 


402689 


RTA0O0O2686F.n.05. 1 .P.Seq 




M0004027IB:E12 


CH13EDT 


1158 


380462 


RTA00002670F.O.0 1 .2.P.Seq 


F 


M000335TOB:E06 


CH09LNL 


U59 


400073 


RTA00002685F.m. 15.2. P.Seq 


F 


M00039600A:A1 i 


CH12EDT 


1160 


373748 


RTA 00002 67 IF. 1.06. 3. P.Seq 


F 


M00038325D:F!2 


CH09LNL 


1161 


401392 


RTA00002635F.f.08.2.P.Seq 


F 


M0OQ395O5C:EO3 


CHI2EDT 


1162 


20548 


RTA000027 lOF.h. 15. 1 .P.Seq 


F 


M0002224~A:E02 


CH03MAH 


1163 


376279 


RTA00002680F.d. 10. 1. P.Seq 


F 


M00039785D:G05 


CH09LNL 


1 164 


374428 


RTA00002672F.a.20.1. P.Seq 


F 


M00038633B:G02 


CH09LNL 


1165 


374428 


RTA00002672F.a.2().2.P.Seq 


F 


M0003S633B:G02 


CH09LNL 


1166 


372914 


RTA000O2679F.J.2 1 . 1 .P.Seq 


F 


M00039696A.E05 


CH09LNL 


1167 


378320 


RTA00002681F.L14.2.P.Seq 


F 


M00039894C:H07 


CH09LNL 


1 16S 


235422 


RTA00002665F.H. 19.1. P.Seq 




M0002S?6SC:D05 


CHOSLNH 


1169 


402473 


RTA00002686F.p. 1 1 . 1 P.Seq 




M000402S~C:B09 


CH13EDT 


1170 


374328 


RTA00002674F.m. 10. 1 .P.Seq 




M0003O170A:B10 


CH09LNL 


1171 


403912 


RTA000O2687FJ. 19.2. P.Seq 




MO0O4031 - A:HO3 


CH14EDT 


1172 


401471 


RTA00002635F.0. 10.2. P.Seq 




M0003962OB.F01 


CH12EDT 


1173 


404362 


RTA00G02687F.O.06. 1 .P.Seq 




M00040342B:D12 


CH14EDT 


1174 


403849 


RTA0O0O2687F.n.09. i P.Seq 




M00O40333D:GO5 


CH14EDT 


1 175 


395617 


RTA0O0O2687F.b. 1 5. 1 P.Seq 


F 


M0003976~B:A04 


CHUEDT 
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SEQ 
!D 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


1 176 


401709 


RTA0O0026S5 F.o.0 1 .2. P.Seq 


F 


M00039624A;H09 


CHI2EDT 


1177 


404464 


RTA00002687F.O.22.1. P.Seq 


F 


M00040347D:F09 


CH14EDT 


1 178 


447795 


RTA00002689F.e.06.3.P.Seq 


F 


M00042895CG01 


CH15CON 


1 179 


18139 


RTAOO002708FT. 10. 1. P.Seq 


F 


M00004139B:B10 


CHOICOH 


1 180 


403893 


RTA00002687F.3.05.I. P.Seq 


F 


M00039746CHO6 


CH14EDT 


1 181 


453512 


RTA00002693F.3.2 1 .2.P.Seq 


F 


M00043078D:D04 


CH19COP 


1182 


404172 


RTA0000268"F.d. 1 7. 1 .P.Seq 


F 


M00039951B:B12 


CH14EDT 


1 183 


400973 


RTA00002685F.c.06.2.P.Seq 


F 


- M00039374CH12 


CHI2EDT 


1184 


450198 


RTA0000269 1 F.e.23.2.P.Seq 


F 


M00043405A:D1 1 


CH17COHLV 


1 185 


451502 


RTA00002691 F.r'.03.2.P.Seq 


F 


M00043406B:G12 


CH17COHLV 


1 186 


454414 


RTA00002693F.t*.l8.2.P.Seq 


F 


M00043220B:C04 


CH19COP 


1 187 


453752 


RTA00002693 F.b.02.2.P.Seq 


F 


M00043031D:F05 


CH19COP 


1138 


403700 


RTA00002687F.g.03. 1 .P.Seq 


F 


M00040207B:D08 


CH14EDT 


1189 


403371 


RTA00002687F.h. 19. 1 .P.Seq 


F 


M00040294D:D12 


CH14EDT 


1190 


14583 


RTA00002687FJ.08. LP.Seq 


F 


M00040203B:A05 


CH14EDT 


1191 


404161 


RTA00002687F.e.20. 1 .P.Seq 


F 


M00039958C:B09 


CH14EDT 


1192 


403274 


RTA00002687F.b. 10.1. P.Seq 


F 


tV!00039766A:G07 


CH14EDT 


1193 


373465 


RTA0000267 1 F.o.09. 1 .P.Seq 


F 


M00033615A;H12 


CH09LNL 


1 194 


402582 


RTA000026S6F.m.08. 1 .P.Seq 


F 


t M00040265D:C08 


CH13EDT 


1195 


40224 1 


RTA000026S6F. 1.16. 1. P.Seq 


F 


M00040261C:F01 


CH13EDT 


1 196 


380451 


RTA00002670F.p. 12. 1 .P.Seq 


F 


M00033581D:D08 


CH09LNL 


1 197 


455938 


RTA00002694F.d.24. 1 .P.Seq 


F 


M00043528C:A02 


CH20COHLV 


1 198 


374297 


RTA00002672F.i.02.2.P.Seq 


F 


M00039013D:F02 


CH09LNL 


1 199 


402624 


RTA00002636F.p. 13.1 .P.Seq 


F 


M00040237D:D07 


CH13EDT 


1200 


402322 


RTA00002636F.J. 16.1. P.Seq 


F 


M00040233A:H02 


CH13EDT 


1201 


449504 


RTA00002690F.C.1 1.2. P.Seq 


F 


M00042769CE09 


CH16COP 


1202 


226704 


RTA00002664F.3. 11.1. P.Seq 


F 


M00023352D:HO3 


CH04MAL 


1203 


271092 


RTA00002690F.b.23.2.P.Seq 


F 


M00042756D:A10 


CH16COP 


1204 


400864 


RTA000026S5F.g.I7.2.P.Seq 


F 


M000395I7B:G12 


CH12EDT 


1205 


235855 


RTA0000266TF.O.06. 1 .P.Seq 


F 


M00032S76C:DO6 


CH08LNH 


1206 


402789 


RTA00002636F.g. 1 6. 1 .P.Seq 


F 


M00040183A:F07 


CH13EDT 


1207 


19826 


RTA000027 1 0F.k.05. 1 .P.Seq 


F 


M00022467C.B12 


CH03MAH 


1208 


380157 


RTA00002632F.h. 1 9. 1 .P.Seq 


F 


M00039934D:G12 


CH09LNL 


1209 


401 187 


RTA00002685F.e.l5.2.P.Seq 


F 


M00039500C:C04 


CHI2EDT 


1210 


427346 


RTA00002665F.b.01.3.P.Seq 


F 


M00028066C:D07 


CH08LNH 


121 1 


402866 


RTA000026S6F.C. 15.1. P.Seq 


F 


M00040I3SB:H03 


CH13EDT 


1212 


376712 


RTA00002677F.c.l3.2.P.Seq 


F 


M00039343B:F12 


CH09LNL 


1213 


401655 


RTA00002635F.C.22. 1 .P.Seq 


F 


M00039373D:H0" 


CH12EDT 


1214 


400147 


RTA000026S5F.ii. 10. 1 .P.Seq 


F 


M00039515A:A06 


CHI2EDT 


1215 


400864 


RTA000026S5 F.g.17.1 .P.Seq 


F 


M00039517B:G12 


CH12EDT 


1216 


451600 


RTA00002691F.b.l9.3.P.Seq 


F 


M00043328D:H02 


CHI7COHLV 


1217 


400147 


RTA00002685F.g. 10.2. P.Seq 


F 


M00039515A:A06 


CHI2EDT 


1218 


401655 


RTA00002635F.c.22.2.P.Seq 


F 


M0003937SD:H07 


CH12EDT 


1219 


449307 


RTA00002690F a. 10. 3. P.Seq 


F 


M00042431D:C10 


CH16COP 


1220 


403121 


RTA000026S3F. 3.01. 2. P.Seq 


F 


M00040366A:B0! 


CH14EDT 


1221 


451718 


RTA000026«2F.e.24.1. P.Seq 


F 


M00045044B:A12 


CHI SCON 




294345 


RTA00002635F.ii. 14. 1 .P.Seq 


F 


M00039515D:C1 1 


CH12EDT 
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CH 19COP 


1242 


447671 
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CH 13CON 


124 j 
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KTA0000269jF-b. 14.2.r.beq 
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124/ 
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403397 


RTA00002687F.h.02.1.P.Seq 
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CHI4EDT 
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RTA00002636F.b.O5 1 .P.Seq 
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451379 


RTA0000269lF.b.l2.2.P.Seq 
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456624 


RTA00002694F.e.02. 1 .P.Seq 
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SEQ 
ID 


CLUSTER 


SEO NAME 


ORIENTATION 


clone ;d 


LIBRARY 


1270 


377039 


RTA00002686F.O. 1 2. 1 P.Seq 


F 


M000402S0C.H05 


CHI3EDT 


1271 


18041 


RTA000027lOFh.2l.l. P.Seq 


F 


M00022262D G03 


CH03MAH 


1272 


401381 


RTA00002685F.O.08. 1. P.Seq 


F 


M00039626D:F04 


CH12EDT 


1273 


428491 


RTA00002666F.C.05. 1. P.Seq 


F 


M000325?5D:HOI 


CH08LNH 


1274 


54656 


RTA0000266 1 F.i.22.2.P.Seq 


F 


M000043":B:F07 


CH01COH 


1275 


379183 


RTA00002679F.U 7.1. P.Seq 


F 


M00039688C.G06 


CH09LNL 


1276 


25594 


RTA0000271 IF.f.07.1. P.Seq 


F 


M00022968B:E02 


CH03MAH 


1277 


403355 


RTA00002687F.d. 11.1 P.Seq 


F 


M00039948D:D11 


CH14EDT 


1278 


16789 


RTA00002709F.b.09.2. P.Seq 


F 


M00005382B:F08 


CH02COH 


1279 


23292 


RTA000O2708F.C.02. 1 .P.Seq 


F 


M00003750D:E06 


CH01COH 


1280 


373982 


RTA00002673F.b.24.2.P.Seq 


F 


M00039O5SA:A04 


CH09LNL 


1281 


373982 


RTA000O2673F.C.OI 2.P.Seq 


F 


M00039058A:A04 


CH09LNL 


1282 


44991 1 


RTA0000269 1 F.e.02.2.P.Seq 


F 


M0004338-iB:B02 


CH17C0HLV 


1283 


450633 


RTA0000269 1 F.f.02.2.P.Seq 


F 


M00043405CGI2 


CHI7C0HLV 


.1284 


23939 


RTA00002713F.;. 14.1. P.Seq F 


M000274S6A;F06 


CH04MAL 


I2S5 


450633 


RTA0000269 1 F.f.02. 1 .P.Seq | F 


M00043405CG12 


CHI7C0HLV 


1286 


379122 


RTA00002672F.n. 14.1. P.Seq j F 


M0003903^3:F09 


CH09LNL 


1287 


449429 


RTA00002690F.a. 16.3. P.Seq | F 


M0004243"A:D04 


CHI6C0P 


1288 


430578 


RTA0000266SF.g. 18. 1 .P.Seq j F 


M0003298-:C:G05 


CH08LNH 


1289 


425824 


RTA00002687F.0. 17.1. P.Seq j F 


M0003976"C:EI2 


CH14EDT 


1290 


425824 


RTA00002687F.b.l7.2.P.Seq | F 


M0003976"C:EI2 


CH14EDT 


1291 


401266 


RTA00002685F.L 1 1 .*>.P.Sea j F 


M00039535D:DIO 


CHI2EDT 


1292 


377949 


RTA00002674F.p.04. 1 P.Seq j F 


M00039200A.C10 


CH09LNL 


1293 


12926 


RTA000027 1 0F.e.2 1 . 1 .P.Seq 


F 


M00022005C.C06 


CH03MAH 


1294 


378242 


RTA00002679F.c.20.2.P.Seq 


F 


M0003966-iD:G07 


CH09LNL 


1295 


401781 


RTA00002686F.e.0S. 1 .P.Seq 


F 


M00040160B:A10 


CH13EDT 


1296 


453101 


RTA00002693F.c.l6.2.P.Seq 


F 


M000431-:3:A10 


CH19COP 


1297 


377592 


RTA00002677F.U22.PSeq | F 


M000394i5D:£0I 


CH09LNL 


1298 


404340 


RTA00002637F.b 05. 1 .P.Seq | F 


M0003976-C.D07 


CHUEDT 


1299 


400968 


RTA00OO2685F.h.O1.2.P.Seq j F 


M0003 C ?5::D:H03 


CH12EDT 


1300 


400968 


RTA00002685F.g.24.2. P.Seq j F 


M000395:;D:H03 


CH12EDT 


1301 


374417 


RTA0000267IF.j.l 5.3. P.Seq \ F 


M0003S3i5C:Gl 1 


CH09LNL 


1302 


374621 


RTA00002675F.p,02.1. P.Seq 


F 


M00039263DAI2 


CH09LNL 


1303 


19063 


RTA00002708F.i.l4.I.P.Seq 


F 


M0000436! A:H02 


CHOtCOH 


1304 


1 3594 1 


RTA000027 1 3F.g.06. 1 P.Seq | F 


M0002735^S:G05 


CH04MAL 


1305 


403355 


RTA00002687F.d. 1 1 .2. P.Seq | F 


M000399-tSO:Dl 1 


CHUEDT 


1306 


375226 


RTA00002677F.m.03.2.P.Seq | F 


M000394i"C.A01 


CH09LNL 


1307 


222658 


RTA00002664F.e.l4.2.P.Seq j F 


MOOO:7103 3:A09 


CH04MAL 


1308 


447978 


RTA000026QOF.d. I 1 .3. P.Seq | F 


iMO004280OA:AO3 


CH16C0P 


1309 


431346 


RTA00002669F 2.24. 1 .P.Seq 


F 


M000332iSA:C04 


CH08LNH 


1310 


455579 


RTA00002694F.3. 10.1 .P.Seq 


F 


M000425^cB:F06 


CH20COHLV 


131 1 


13406 


RTA00002709F.1. 14.1. P.Seq 


F 


M0000712-D-H10 


CH02COH 


1312 


378364 


RTA0000Z674F.0. 1 7. 1 P.Seq | F 


M0005^l^cD A07 


CH09LNL 


1313 


373788 


RTA00002671 F.c.1 6.2. P.Seq | F 


M0003S:5-A:G08 


CHO^LNL 


1314 


403548 


RTA0000:683F.a.i0.:.P.Seq | F 


M000403oiO:E09 


CHUEDT 


1315 


22425 


RTA00002T09F.C. 08.2. P.Seq | F 


M000054^S.A:H06 


CH02COH 


1316 


452238 


kTA000026Q2F.c.2 1 Z.P.Sec F 


M00042 ll °SA.G04 


CHI SCON 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


1317 


446680 


RTA00002689F.C.04. 1 .P.Seq 


F 


ft M A A A A ^% f **N ^ A a 

M0004269jD:E04 


CHbCON 


1318 


142922 


RTA000027 1 2F.2.02. 1 .P.Seq 


F 


M00026860B:COd 


f f A 4 ft ^ A ■ 

CH04MAL 


1319 


450196 


RTA0000269 1 F.c. 19.j>. P.Seq 


F 


M0004jj:9B:D10 


CH17COHLV 


1320 


26017 


RTA00002709F.d.04. 1 .P.Seq 


F 


M0000^601D:D08 


CH02COH 


1321 


380355 


RTA00002670F.O.06. 1 .P.Seq 


F 


MOOOjj^ . 0C:C10 


CH09LNL 


1322 


25232 


RTA000027 1 OF. n. 22. 1 .P.Seq 


F 


M0002266 . D:B02 


CHOjMAH 


1323 


378952 


RTA00002683F.h.l 1.1. P.Seq 


F 


M000400 f 0B:B07 


CH09LNL 


1324 


404487 


RTA00002687 Fx. 1 3 ,2.P.Seq 


F 


M000j994jB:FI0 


CH14EDT 


1325 


48482 


RTA000027 1 2F.p.06. 1 .P.Seq 


F 


M000271^9D:F0j 


i in i > 1 i i 

CH04MAL 


1326 


373705 


RTA00002673 F.a. 13.1 .P.Seq 


F 


M000j90^_C:F0/ 


CH09LNL 


1327 


373705 


RTA00002673F.a. !3.2.P.Seq 


F 


M000390r2C:F07 


CH09LNL 


1328 


21162 


RTA00002709F.c.03.I.P.Seq 


F 


M0000d4-:9B:D01 


CH02COH 


1329 


15203 - 


RTA00002 7 1 OF.a.2 1 . 1 .P.Seq 


F 


M0O0O/9/2B:H12 


CHOjMAH 


1330 


21 162 


RTA00002709F.C.03.2. P.Seq 


F 


M0000^449B:D01 


CH02COH 


1 J J 1 


401013 


RTA00002685F.0. 1 6.2.P.Seq 


F 


M0003964!A:A05 


f f • A^*V^^^ 

CH12EDT 


1 jj2 


404449 


RTA00002637F.c.04.2.P.Seq 


F 


M00039770C:E04 


CH14EDT 


1 

1 m 


429672 


RTA0000266SF.b. 1 0. 1 .P.Seq 


F 


M00032909A:B06 


CH08LNH 


1 jj4 


48541 


RTA000027 1 2 F.i.07. 1 .P.Seq 


F 


M0O0269::C:B0: 


CH04MAL 


1 -l -> .r 

1 J 


378424 


RTA0000268 1 F.a.03.2. P.Seq 


F 


M00039839B:B01 


CH09LNL 


1 336 


49540 


RTA000027 1 2F.d.24. 1 .P.Seq 


F 


M00023399C:EI0 


CH04MAL 


1 _>j7 


379170 


RTA00002672F.i.2t.I.P.Seq 


F 


M00039016D:G06 


CH09LNL 


1 ^ ■* o 


179540 


RTA00002683F.O.20.2. P.Seq 


F 


M00040100C:E05 


CH09LNL 


1339 


45 1 269 


RTA0000269lF.f.l t.l. P.Seq 


F 


M 00043 4 i 13: DOS 


CH17COHLV 


■i 

1340 


449832 


RTA0000269 1 F.e. 1 3.2. P.Seq 


F 


M00043393A:B08 


CHI 7COHLV 


1341 


380119 


RTA00002670F.m.20.2.P.Seq 


F 


M00033560D:G07 


CH09LNL 


1342 


153094 


RTA000027 1 4F.a. 1 2. 1 .P.Seq 


F 


M00027743A:C03 


CH04MAL 


1343 


448749 


RTA00002690F.d. 14.2. P.Seq 


F 


M00042806C:F07 


CH16COP 


1344 


448749 


RTA00002690F.d. 14. 3. P.Seq 


F 


M000^2SOcC:F07 


CH16COP 


1345 


454816 


RTA00002693F.b. 16. 1. P.Seq 


F 


M00043096A:G04 


CH 1 9COP 


1346 


374744 


RTA00002670F.L 1 6.2.P.Seq 


F 


M000334. D:F01 


CH09LNL 


1347 


404449 


RTA00002687F.C.04. 1. P.Seq 


F 


M00039770C:E04 


CH14EDT 


1348 


58005 


RTA0000266 1 F.h. 1 4. 1 .P.Seq 


F 


M000042--C:EOj 


CH0ICOH 


1349 


451379 


RTA0000269 1 F.b. 12.3. P.Seq 


F 


M0004jj1-C:E08 


CHI /COHLV 


1350 


456323 


RTA00002694F.d.2 1 . 1 .P.Seq 


F 


M00043r263:D10 


CH20COHLV 


1351 


455957 


RTA00002694 F.c. 1 5. 1 .P.Seq 


F 


M0004346r^:AO^ 


CH20COHLV 


1352 


428063 


RTA00002666F.1.05. 1 .P.Seq 


F 


M000326jSC:G08 


CH08LNH 


1353 


374722 


RTA00002676F.j.l9.3.P.Seq 


F 


M000j93IOA:CO/ 


CH09LNL 


1354 


428407 


RTA00002665F.p. 1 2. 1 .P.Seq 


F 


M000->2> I0D:F12 


CH08LNH 


1355 


378000 


RTA0000268 1 F.j. 1 6. 1 .P.Seq 


F 


» * A. A A ^ A, A. A ^» Z - "^ /\ * 

M000->988 D:C04 


i lAii r vtl 

CHO^LNL 


1 _n6 


4d2717 


RTA00UU_69_ir.D. 1 /.-.r.beq 


r 

r 






1357 


378000 


RTA0000268IF.j.l6.2.P.Seq 


F 


M0003988"D:C04 


CH09LNL 


1358 


448356 


RTA00002690F.c.03.3.P.Seq 


F 


M00042760A:C12 


CHI6COP 


1359 


456629 


RTA00002694F.d.04. 1 .P.Seq 


F 


M00043491C:F04 


CH20COHLV 


1360 


431346 


RTA00002669F.g.24.2.P.Seq 


F 


M000332ISA:C04 


CHOSLNH 


1361 


377206 


RTA00002682F.m. 14. 1 .P.Seq 


F 


M000400I 5C :F08 


CH0OLNL 


1362 


453036 


RTA00002692F.b. 1 1 .2. P.Seq 


F 


M00042960D-.H08 


CHI SCON 


1 363 


402632 


RTA00002686F.g.I5.l.P.Seq 


F 


M00040182D:D06 


CH ! 3EDT 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


1364 


230532 


RTA00002664F.C. 1 1 .2. P.Seq 


F 


M00026901A:G07 


CH04MAL 


1365 


30755 


RTA00002663F.e.03.1. P.Seq 


F 


M00022138A.E05 

• 


CH03MAH 


136.6 


451438 


RTAO0OO2691F.d.23.3. P.Seq 


F 


M00043383CF12 


CHI7COHLV 


1367 


37901 1 


RTA0000268 1 F.n.23. 1 .P.Seq 


F 


M00039903CD0I 


CH09LNL 


1368 


404048 


RTA00002687F.g.0 1 . 1 .P.Seq 


F 


M00040206A:A07 


CH14EDT 


1369 


404048 


RTA00002687F.2.0 1 .2.P.Seq 


F 


M00040206A:A07 


CH14EDT 


1370 


452398 


RTA000 02692F.f 17.2.P.Seq 


F 


M00043125C:A11 


CHI SCON 


1371 


403686 


RTAO0002687F.d.03.1. P.Seq 


F 


- M000399463:F08 


CHI4EDT 


1372 


403686 


RTA00002687F.d.03.2.P.Seq 


F 


M00039946B.F08 


CHI4EDT 


1 ^ "7^ 
I J 1 J 


404048 


RTA000026S7F.f.24.2.P.Seq 


F 


M00040206A:A07 


CHI4EDT 


1374 


404048 


RTA00002687FX24. 1 .P.Seq 


F 


M00040206A:A07 


CHI4EDT 


1375 


450627 


RTA0000269IF.f.0l.2.P.Seq 


F 


M0004340fC:G02 


CH17COHLV 


1376 


375589 


RTA00002680F.f.06.2.P.Seq 


F 


M00039794A.E04 


CH09LNL 


1377 


37901 1 


RTA0000268 1 F.n.23 .2.P.Seq 


F 


M00039903C:D01 


CH09LNL 


. 1378 


16789 


RTA00002709F.b.09. 1 .P.Seq 


F 


M00005382B:F08 


CH02COH 


1379 


427346 


RTA00002665F.a.24.3. P.Seq 


F 


M00028066C:D07 


CH08LNH 


1380 


49540 


RTA00002712F.e.01.1.P.Seq 


F 


M00023399C:E10 


CH04MAL 


1381 


14440 


RTA00002674F.e.l4.2.P.Seq 


F 


M00039129C:D04 


CH09LNL 


1382 


391401 


RTA000026S2F.k.l 1. 1. P.Seq 


F 


M00040004D:B03 


CH09LNL 


1383 


43782 


RTA00002662F.d.2 1 .2.P.Seq 


F 


M00007165 B:G11 


CH02COH 


1384 


212635 


RTA00002666F.p.0 1 . 1 .P.Seq 


F 


M00032688D:Dll 


CH08LNH 


1385 


15618 


RTA000027 lOF.o.05. 1 .P.Seq 


F 


M0002268-:A:C02 


CH03MAH 


1386 


18501 


RTA00002669F.g.23.3.P.Seq 


F 


M0003321"B:H07 


CHOSLNH 


1387 


400310 


RTAO0OO2688F.b.05.2.P.Seq 


F 


M00040375C:B06 


CHUEDT 


1388 


403796 


RTA00002687F.H. 17. 1. P.Seq 


F 


M00040293D:G04 


CHI4EDT 


1389 


452314 


RTA00002694F.a.2 1 . 1 .P.Seq 


F 


M000434I6C:A02 


CH20COHLV 


1390 


1 19179 


RTA000027 l2F.k.20. 1 .P.Seq 


F 


M00027021A:G02 


CH04MAL 


1391 


167451 


RTA00002663F.j.l I.I. P.Seq 


F 


M00022646A:H10 


CH03MAH 


1392 


450523 


RTA0000269 1 F.e. 1 9.2. P.Seq 


F 


M00043401 D:G03 


CHI7COHLV 


1393 


289535 


RTA00002693 F. f.06. 1 .P.Seq 


F 


M000432023:F0I 


CH19COP 


1394 


374736 


RTA00002673F.o.08.2.P.Seq 


F 


M00039I I23:C05 


CH09LNL 


1395 


378912 


RTA0000 2 672F.n.0 1 .2.P.Seq 


F 


M000390J6CB05 


CH09LNL 


1396 


134877 


RTA00002662F.d.05.2. P.Seq 


F 


M00007026B:H09 


CH02COH 


1397 


37281 1 


RTA00002670F.C. l2.2.P.Seq 


F 


M0003334"C:F02 


CH09LNL 


1398 


373296 


RTA00002672F.e.08.2.P.Seq 


F 


M00038994A:A10 


CH09LNL 


1399 


373296 


RTA00002672F.e.08. 1 .P.Seq 


F 


M000389«UA:A10 


CH09LNL 


1400 


452903 


RTA000026Q2 F.t". 08.2. P.Seq 


F 


M00043060D:G12 


CH18CON 


1401 


450067 


RTA00002691F.C. 17,3. P.Seq 


F 


M00043352D:C03 


CHI7COHLV 


1402 


451013 


RTA00002691F.f.08.LP.Seq 


F 


M0004340°B:B03 


CH17COHLV 


1403 


212635 


RTA00002666F.O.24. 1 .P.Seq 


F 


M000326SSD:DII 


CHOSLNH 


1404 


452367 


RTA000O2692F.c.02.2.P.Seq 


F 


M00042976A.H04 


CHI SCON 


1405 


450627 


RTA0000269 1 F.e.24. 1 .P.Seq 


F 


M0004340fC:G02 


CHI 7COHLV 


1406 


186438 


RTA00O02713F.i. 15.1. P.Seq 


F 


MOOQ27462A:D07 


CH04MAL 


1407 


431066 


RTA00002669F.c.l7.3.P.Seq 


F 


MO0033IS°D:FOS 


CHOSLNH 


1408 


378912 


RTA0000:672F.m.24.2.P.Seq 


F 


M00039036C:B05 


CH09LNL 


1409 


15731 


RTAO0002709F.1. 13.1. P.Seq 


F 


M00007I 16C:G02 


CH02COH 


1410 


377187 


RTAO00O2683F.d.2L2.P.Seq 


F 


M0004004^C:F05 


CHOUNL 



W 00102568 rfile://C:\Documents and Settings\Lawrence\Local Settings\Temporary Internet Files\OLK11\WO0102568.cpc1 

WO 01/02568 PCT/US00/18374 



Page 100 of 1046 



in 






VJRittN 1 A I \\Jri 


CLUiNb ID 


LIBRARY 


\A\\ 


J / O 1 07 


r> f \ n AAA7X771? -» AO T» D C 

K I AUUUU20 / /r.a.Uo.-i.r.beq 


F 


M000j9jjjD:D09 


CH09LNL 


1 A 1 7 
14 |J 


43U38U 


D T* \ A AAA - * AO 1 C «~ 7A ? D C 

K I AUUUU-iOV ir.c.iU.j.r.beq 


F 


M0004jj39C:G01 


CHI /COHLV 


14 1 .> 


J /VV4^ 


DT \AAAA'>/i70C I 71 ID Cw» 

K I AUUUUJo /Vr.l.J 1 . 1 .r.j(?q 


r- 

r 


M000j9/0/ A:D02 


CH09LNL 


MM 

1 4 1 4 


J / 33oy 


OT \ AAAAI/iOAC TAA 1 D 

K 1 AUUUU-OoUr.ruo. 1 .r.ieq 


r* 
F 


iV1000j9 /94A;E04 


CH09LNL 


14 1 3 


J / 3 / oy 


K I AUUUU-0 /4r .a. l o. i , r.^eq 


F 


M000j9120C;H0j 


CH09LNL 


l a i £ 

14 10 


43022 / 


D T \ AAA A7XO,1 C r- 1 t> C in 

R I AUUUUJoy4rx. 10. 1 .r.oeq 


F 


M0004 j 4 6 ^ C ; C09 


CH20COHLV 


\A t 7 


433832 


DT* \AAAAT£Q-1C -« A7 I D C-*s- 

K I AUUUU_oy4r .a.U^. 1 .r\oeq 


r- 

r 


M00042r 92 A : H 1 0 


CH20COHLV 


Id 1 0 
14 1 o 


23 loy 


D T A A AAA7 7 1 AC rv» A^ 1 D C Jrt 

K 1 AUUlfUJ / lUKm.LD. 1 .r.oCq 


r* 

r 


M000223 /9C:C1 1 


CHOjMAH 


1 J I Q 
1 4 I V 


J/0524 


DT* \ AAAA7£7QC U 71 7 D C 

K I AUUU020 /or .tl.iJ.i.r.beq 


F 


MOO0j94/7A:BOj 


CH09LNL 


1 j~>a 


449302 


DT" \ AAAA*>AQAC k 1 "? 7 D C 

K I AUUUU-oyUr.D. 1 j.i.r.oeq 


F 


k {AAA f -\ £ \ £ PAO 

M000423 1 3C:F08 


CH 1 6C0P 


I 4_ I 


449362 


DT* \ AAAA7iQAC k 1 1 ** D Ca« 

KTAU0U0269yr.D. ! j.j.P.beq 


F 


& J AAA £ P* A O 

M000423 1 3C.F08 


CH16C0P 




iooUO 1 


DT \ AAAA^JiQAC k AO 7 D C 

K I AUOUU-oyUr.D.Uo.i.r.beq 


r* 

r 


ft 1 A A A 1 ^ J" Ilk I f A 1 

M000423 1 1 A:H04 


CH I6C0P 


1,1")^ 


286001 


R.TA000026V0r .0.08. j.P.beq 


F 


M000423 1 1 A:H04 


CHI6C0P 


1424 


OA"* ***** 

->80j22 


RTA0000268jr.p.2 1 . 1 .r.Seq 


F 


M00040106B:309 


CH09LNL 


1425 


40 160 j 


RTA00002683rJ.2j.2.P.5eq 


F 


M000j93 I0C:G02 


CHI2EDT 


1426 


j7634I 


RTA00002678F.d. I j.2.P.Seq 


F 


M000j9436A;C08 


CH09LNL 


1427 


449 I2j 


RTA00002690F.a. I j. j.P.Seq 


F 


M0004243>A:AI1 


CH16C0P 


1428 


4I8j38 


RTA00002686F.m.07. 1 .P.Seq 


F 


M0004026rD:30/ 


CHI3EDT 


1429 


j80263 


RTA00002689F.a.22. l.P.Seq 


F 


M00042543OG04 


CH15C0N 


I4j0 


435v48 


RTA00002694F.b.06. 1 .P.Seq 


F 


M00043428D:G08 


CH20COHLV 




451679 


RTA0000269jF.a.04.2.P.Seq 


F 


M000426I2D:F06 


CH19C0P 


I4_>2 


j96jj2 


RTA00002686F.R. 14. l.P.Seq 


F 


M000402:2C;C06 


CH13EDT 


14jj 


j /7378 


RTA0000268jF.d. 1 1 .2. P.Seq 


F 


M00040037A:E 1 1 


CH09LNL 


14_>4 


20061 


RTA00002 / lOF.m. 14. l.P.Seq 


F 


M0002239, D:A06 


CHOjMAH 


14j3 


402494 


RTA00002686F.h. 1 6. l.P.Seq 


F 


M00040191 A;B09 


CHI jEDT 


i ,i ^ ^ 
I4j6 


j 72798 


RTA000026 /OF.c. 1 8.2. P.Seq 


p» 
F 


M000jj.*49D:F03 


CH09LNL 


14j7 


2 j 629^ 


RTA00002679F.a. 1 9.2. P.Seq 


F 


M000j96^^B:H09 


CH09LNL 


I no 

I4j8 


43 1 370 


r> -T* . A AAA"! Z'Pi t P* _ /\"* ^ ft P* 

RTA00002691 F.c.Oj.j. P.Seq 


F 


M0004.- *40B:H08 


CHI /COHLV 


I4_>9 




RTA00002708F.n.0j. l.P.Seq 


F 


M000042j 1 ?B:F1 1 


CH01COH 


1440 


453706 


P* *T* 4 A AAA"! A i r 1 1 A 1 fi C 

RTA00002694F.D. 10. l.P.Seq 


F 


M0004j4j.>B;G09 


CH20COHLV 


144 1 


j46j 10 


RTA0U0u_634Ka. 1 8. 1 .P.Seq 


F 


M00040122D:A02 


CH09LNL 


1 44 J 


189361 


RTA00002676r j.Ov.j. P.Seq 


p- 
F 


k JAAA^A ^ AO P\ v~ A rt 

MOO0.*9jO8B:GO8 


y™ 1 I 1 AA 1 V * 1 

CH09LNL 


1 A 1 

144 j 


40j200 


rj t* \ AAAA1£07C I 7 1 1 O C 

RTA000U2o8 1 r .j.24. 1 P.Seq 


p- 
F 


M00040." ISA: 302 


CH I4EDT 


1444 


,|A| | | ^ 


p> T A AAAA1£OCC I 7 D C ^« 

RTA0UUU2o83r .i.0j.2.P.Seq 


F 


k JAAA'^AJ' ^A P> p* A 

MOO0j93j0B:EO2 


PI | IIP »— \ -T* 

CH I2EDT 


1443 


448680 


D T* \ ArtAAT£QAC U A7 7 D C 

R I AOOOO-oyOr.D.Ui.j. P.Seq 


F 


k 4AAA i ^ | * A P"l r AA 

M00042440B:E09 


/■>» 1 1 / p* 
CHI6COP 


l i A£L 


1 1 /U60 


DT,\ AAAA1X7GC k T-l 1 D C 

R I AUUOU20 fyr .n.J4. | . r.oeq 


t- 
r 


Ik 4 AAA 11 C\£. O £ j~"f\£ 

M000j9686C:C0^ 


CH09LNL 


144 / 


40j ZiJi) 


D 7* A AAAA7A0"*C ! 7,1 1 D Cjn 

R 1 AUUUUJOiS / r J.-4,^.r.oeq 


r 


k J AAA 1 A "* 1 O V r~) A ^ 

M00040 j 1 8 A : B0- 


CH 1 4ED I 


1 AAQ 


a 1 0son 
448389 


DT \ AAAA7/CQAC ■> AT 1 D C_»» 

R 1 AUUUU-oyUr.a.U j.r.beq 


r* 

r 


IM0004J j»49 D: DO 


C H 1 oCOP 


1 44V 


i/j806 


D T \ AAAAT/17't C A7 1 D Can 

R 1 AUUOU-O /4r .0.U-. 1 .r.oeq 


r 


k »AAA"»A1 — A . A A 

MO00j91 /9A:G09 


CH09L?sL 


i4sn 


J / /U33 


I\ I .-\uUUu_Oo_r .K. IJ . 1 - r . jcq 


c 

r 


V/1AAA-1 AAA < Q-^ 1 1 


r*UAO* vi 


1451 


3731 1 1 


RTA00002670F.n. 14.2.P.Seq 


F 


M00033566C:EO8 


CH09LNL 


1452 


12350 


RTA000027l3F.a.05. l.P.Seq 


F 


M00027195C:E04 


CH04MAL 


1453 


450366 


RTA0O0O2691F.C.06.3. P.Seq 


F 


M00043344D:E04 


CHI7C0HLV 


1454 


397851 


RTA00002630F.b.04.2.P.Seq 


F 


M00039775A:A09 


CH0OLNL 


1455 


403200 


RTA00002687F.k.0 1.2. P.Seq 


F 


M00040318A:BO: 


CH14EDT 


1456 


403200 


RTA00002687F.k.0 1.1. P.Seq 


F 


M0004031SA:B02 


CH14EDT 


1457 


40! 142 


RTA00002687F.i.24.2.P.Seq 


F 


M000403I3CD05 


CH14EDT 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


1458 


375221 


RTA00002679F.k. 19. LP.Seq 


F 


M00039702A:B02 


CH09LNL 


1459 


403471 


RTA00002687F.a. U. 1 .P.Seq 


F 


M0003<r-19D:D05 


CHUEDT 


1460 


12270 


RTA 0000271 1 F.f.23. LP.Seq 


F 


M00023007CEI0 


CH03MAH 


1461 


401013 


RTA000026S5F.0. 1 6. 1 .P.Seq 


F 


M0003964 1A:A05 


CH12EDT 


1462 


74344 


RTA0000266lF.f. 10. LP.Seq 


F 


M00003902A:C03 


CHOtCOH 


1463 


423432 


RTA00002687F . 1. 1 0.2. P.Seq 


F 


M00040323CG1 1 


CHI4EDT 


1464 


423432 


RTA0OOO2687F.UO. LP.Seq 


F 


M0OO40323CCI1 


CHHEDT 


1465 


379560 


RTA00002682F.g. 18. LP.Seq 


F 


M00039981 A: EOS 


CH09LNL 


1466 


122669 


RTA000027 1 2F.f.22. 1 .P.Seq 


F 


M00026857D:G12 


CH04MAL 


1467 


373319 


RTA0000267 1 F.c. 1 7.2.P.Seq 


F 


M0003S259B-.A02 


CH09LNL 


1468 


448034 


RTA00002690F.b. 16.2.P.Seq 


F 


M0004275iC:C12 


CH16COP 


1469 


376366 


RTA00002677F.h.05.2.P.Seq 


F 


M0003939'B:H09 


CH09LNL 


1470 


452253 


RTA00002692F T.04.2.P.Seq 


F 


M0004304fD:GI2 


CHI SCON 


1471 


401601 


RTA00002685 F. f. 1 8.2. P.Seq 


F 


M00039508CGOI 


CH12EDT 


1472 


373647 


RTA00002672F.d.04. 1 .P.Seq 


F 


M00058664C:E04 


CH09LNL 


1473 


37972 1 


RTA00002676F.b.20.2.P.Seq 


F 


M0003^276B:H09 


CH09LNL 


1474 


446404 


RTA00002689F.e.02.3.P.Seq 


F 


M00042SS7C:D07 


CH15CON 


1475 


403738 


RTA000026S7F.a.l0.2.P.Seq 


F 


M0Q039743A:FII 


CHUEDT 


1476 


376887 


RTA00002674F.r".23.2. P.Seq 


F 


M00039I35D:H02 


CH09LNL 


1477 


373787 


RTA00002677F.1.04.2.P.Seq 


F 


M0003^4UD:G03 


CH09LNL 


1478 


401375 


RTA00002685 F.n.04. 1 .P.Seq 


F 


M00039604B:E05 


CH12EDT 


1479 


401375 


RTA00002685F.n.04.2.P.Seq 


F 


M00039604B:£05 


CHI2EDT 


1480 


403232 


RTA 0O0026S7 F .§ .20.2. P.Seq 


F 


M0004021SCC02 


CHUEDT 


1481 


403232 


RTA000026S7F.g.20. 1 .P.Seq 


F 


M000402!8C:C02 


CHUEDT 


1482 


449080 


RTA00002690F.a.04.2.P.Seq 


F 


M000423-i-D:Hll 


CH16COP 


1483 


430973 


RTA00002669F.a.03.4. P.Seq 


F 


M00033 r63:E12 


CH08LNH 


1484 


374742 


RTA00002676F.cl2.2.P.Seq 


F 


^10003^2796:0 1 


CH09LNL 


1485 


44974 1 


RTA00002690F.e.23.2.P.Seq 


F 


M00042S568-.H02 


CH16COP 


I486 


45341 


RTA0Q0027 lOF.k. 19. 1 .P.Seq 


F 


MO0O224*39A:B02 


CH03MAH 


1487 


451220 


RTA0000269 1 F.f.07.2. P.Seq 


F 


M00043408B:DI 1 


CH17COHLV 


1488 


22067 


RTA0000270SF.f. 12. 1. P.Seq 


F 


M00004UOD:C03 


CHOICOH 


1489 


378952 


RTA00002683F.h.!l.2.P.Seq 


F 


M000400"0B:B07 


CH09LNL 


1490 


401435 


RTA00002635F_n. U.2.P.Seq 


F 


M0003%0"D:E0S 


CH12EDT 


1491 


375284 


RTA00002676F.g.21.2.P.Seq 


F 


M000392^8D:B04 


CH09LNL 


1492 


449080 


RTA00002690F.a.04.3. P.Seq 


F 


M00042347D:HI1 


CH16COP 


1493 


37897 


RT A0000266 1 F.b.15. 1 .P.Seq 


F 


MO00OI4~63:GIO 


CHOICOH 


1494 


7572 


RTA00002709F.h.03. 1 .P.Seq 


F 


M0000f>S0OB:B09 


CH02COH 


1495 


377076 


RTA00002682F.f. 14.1. P.Seq 


F 


M0003^3D:D12 


CH09LNL 


1496 


374828 


RTA00002674F.m. !0.2.P.Seq 


F 


M0003°I"0A:BI0 


CH09LNL 


149" 


400295 


RTA00002685F.a. 17.2. P.Seq 


F 


M0003^o.-A:C09 


CH 12ED I 


1498 


401435 


RTA0O002685F.n.U. LP.Seq 


F 


M0003 t HiO^D:E08 


CH12EDT 


1499 


374680 


RTA00002676F.C. 14. LP.Seq 


F 


M0003O:-QC;B08 


CH09LNL 


1500 


399018 


RTA00002684F.d.20.2.P.Seq 


F 


M00040I23A:A09 


CH09LNL 


1501 


376351 


RTA00002678F.C. 19.2. P.Seq 


F 


M0003 l) 45:C:G09 


CH09LNL 


1502 


19699 


RTA000027 1 OF. f. 18. LP.Seq 


F 


M00O:2IO5C:Ct2 


CH03MAH 


1503 


394113 


RTA00O02665F.d.i5.3.P.Seq 


F 


MOOO:S3UD:F05 


CH08LNH 


1504 


452652 


*RTA00002692F.a. 16. LP.Seq 


F 


M0004>2~C:D0I 


CHI8CON 



[O o 
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ID 




c cn vi \ vt c 
o tv NAivlfc 


ORIENTATION 


CLONE ID 


LIBRARY 




i £ rt. rt i 

450791 


RTA0000269I F.b.2j.j. P.Seq 


F 


MO0O4jjj8B:A03 


CH17COHLV 


b06 


201 12 


n t» fcrtrt./ h Vrt.*^**1ir*L L I /" 1 n\ 

RTA00002/ 1 I F.b. 16. l.P.Seq 


F 


vM000228jOD:D01 


CH03MAH 


b07 


4dd 142 


i-\ * rt rt rt /\ *^ /* i f** l rt rt i r% r+ 

RTA00002694F.b. 08. l.P.Seq 


F 


M000434j i D:B08 


CH20COHLV 


t C A O 

1508 


1 1 T rt X" rt 

1 1 7060 


RTA000026/9F.I.OL 1 .P.Seq 


F 


* ■ y% rt rt rt ✓ rt y™* j" - * /\ — 

M00Oj9686C:C0d 


T ■ ,rt^ ■ * * 

CH09LNL 


1509 


4478^9 


RTA00002689r.d. b. l.P.Seq 


F 


ih j rt. rt rt. i *^ ^ f t rt ft 

M000427j , C:H04 


CHbCON 


1 510 


452^72 


RTA00002692F.e. 1 6. 1 .P.Seq 


F 


^ m rt rt rt i ^ f\ * ft^% a * 

M0004j0j4D:C01 


CHI SCON 




4486^9 


RTA00002690F.a.06.j.P.Seq 


F 


M00042j48B:E0d 


CH16COP 


ICM 

1512 


^ rt rt, < *^ 

j78947 


RTA0000268j F.o. 1 2.2. P.Seq 


F 


» < rt rt rt i rt rt rt rt rt t 

M00040098C:B01 


1 i rt rt. t v r v 

CH09LNL 


1 C 1 "* 

b 1 _> 


** rt ^ rt rt 

403 >99 


RTA0000268 / F.i. 1 2.2. P.Seq 


F 


m ^ rt rt rt i rt *s rt rt i^v ■ a 

M00040299B:F10 


CH14EDT 


1 C 1 ■ 

1514 


ft rt. * /*\ A ■ 

404084 


RTA00002688F.d. 16.2. P.Seq 


F 


% ft rt rt rt *rt^rt^ t^v t * rt i 

M00040j92B:H01 


CHI4EDT 


lib 


j7d243 


4 rt rt rt rt ^ v ri rt, J. ^ * I rx 

RTA00002680F.d.24. 1 .P.Seq 


F 


& M rt rt rtL ^ rt ^4 rt rt y"™*' a a 1 

M00Oj9788C:A01 


f m rt rt I % Iff 

CH09LNL 


ID 16 


rt, — 

22966D 


RTA00002664F.c.08.2.P.Seq 


F 


& ^ rt rt rt y rt rt * & • * /\ /\ 

M0002688^A:H09 


tfrt % \. m * 1 

CH04MAL 


15 17 


450270 . 


r% & rt rt rt rt ^ rt 1 t rt ^ f 

RTA0000269 1 F.a. 1 8 j. P.Seq 


F 


M00042r 18D:D04 


CH 1 /COHLV 


1518 


448841 


n ' t ' a rt rt rt rt ^ y rt rt 1 I rt "X 

RTA00002690F.d. 1 O.j. P.Seq 


F 


M00042/99D:F08 


CH16COP 


bl9 


447613 


RTA00002689F.C. 1 Ll-.P.Seq 


F 


M00042693D:E01 


CHbCON 


1520 


453909 


RTA00002693F.d.24.2. P.Seq 


F 


M00043I73D:G03 


CH19COP 


1521 


400213 


RTA0000268iF.a.06.2.P.Seq 


F 


M0003918^3:309 


CH12EDT 


1522 


403738 


RTA00002687F.a. 1 0. 1 .P.Seq 


F 


iM00039748A.FI 1 


CH14EDT 


1523 


456725 


RTA00002694F.e. 14. 1 .P.Seq 


F 


M00043648A:G07 


CH20COHLV 


1524 


230842 


RTA00002665F.n. 1 5. 1 .P.Seq 




M00032492A:C01 


CH0SLNH 


1525 


450149 


RTA00002692F.a.20.2.P.Seq 




M00042630A;C05 


CHI SCON 


1526 


34343 


RTA00002709F.a. 1 3. 1 .P.Seq 




M00005297D:H08 


CH02COH 


1527 


403956 


RTA00002688F.C. 12.2. P.Seq 




M000403S3A:H02 


CH14EDT 


1528 


375243 


RTA00002680F.e.01.2.P.Seq 




M00039788C:A01 


CH09LNL 


1529 


375243 


RTA00002680F.d.24.2.P.Seq 




M00039788u:A0l 


CH09LNL 


1530 


373647 


RTA00002672F.d.04.2. P.Seq 




M00038664L:E04 


CH09LNL 


b3l 


376897 


RTA00002674F.L20. 1 .P.Seq 




M0003916, 3:H09 


CH09LNL 


1532 


23468 


RTA00002708F.e.02. 1 .P.Seq | F 


M0000j99 :c:F06 


CH01COH 


I5j3 


455184 


RTA00002694F.a.05. 1 .P.Seq 




iV100042d9.*A:C02 


CH20COHLV 


b^4 


455327 


RTA00002694F.a.22. 1 .P.Seq 




M0004j41 ^iDO^ 


ft * ^ rt 1 t I \ / 

CH20COHLV 


15.0 


455189 


RTA00002694F.C.09. 1 .P.Seq 




M0004^46tD:C02 


y^** tfrt rt * i ft l / 

CH20COHLV 


1536 


455688 


RTA00002694F.C. 1 8. 1 .P.Seq 




M0004j4/C?A:F07 


CH20COHLV 


15j7 


4^6286 


RTA00002694F.b.23. 1 .P.Seq 




M0004j4>Ol:C06 


y— * ft 1 ^ rt y r ^V i T I 1 J 

CH20COHLV 


■ ~ rt 

bj8 


455883 


RTA00002694F.a.23. l.P.Seq 




M0004j418A:H10 


• i ^ rt i it 1 / 

CH20COHLV 


b.>9 


ft >*■ % a a 

4^6308 


RTA00002694F.d.22. 1 .P.Seq 




k * rt rt rt ft ^ — rt ft^rtrt 

M0004j^2/C:E09 


CH20COHLV 


I ^ i rt. 

1540 


* 4^2720 


RTA00002694F.d. 14. l.P.Seq 




& * rt rt rt • ^ ■ — v t ■ rt rt 

M0004_ol63:H09 


CH-0COHLV 


I £ I 1 

b41 


455319 


RTA00002694F.b. 1 3. l.P.Seq 




M0004^4j , D:D04 


CH20COHLV 


1542 


4;n8b 


RTA00002694F.C.24. 1 .P.Seq 




& a rt. rt rt t ^ i i> ^ ■ 

M0004^48.'3.G 10 


CH20COHLV 


b4j 


451814 


RTA00002692F.e.20.2. P.Seq 




M0004j0403:307 


CHI SCON 


l j44 


4436^9 


RTA00002690F b.07.3. P.Seq 




iVIU0U4_.4 / UL . bU^ 


L H 1 oLUr 


1545 


450578 


RTA0000269I F.b.20.3. P.Seq 




M00043332C:G04 


CH17COHLV 


1546 


451 193 


RTA00002691F.b.0 1.3. P.Seq 




M00043304C.D02 


CH17COHLV 


1547 


451981 


RTA00002692F.C. 23.2. P.Seq 




M00043001D:D03 


CHI SCON 


1548 


447359 


RTA00002689F.d. 1 3.2. P.Seq | F 


M00042737C:H04 


CHI SCON 


1549 


44^415 


RTA00002690F.a.23.5.P.Seq | F 


M00042439B:D03 


CHI6COP 


1550 


451 193 


RTA00002691F.a.24.2 P.Seq 




M00043304C:D02 


CHI7COHLV 


1551 


452032 


RTA00002692F.e.04.2. P.Seq 




M000430:6C:DO7 • 


CHI SCON 



loi 
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1552 


452028 


RTA00002692Fb.03.2. P.Seq 


F 


M00042955CD05 


CHiSCON 


1553 


453981 


RTA0O002693F.C.07.2. P.Seq 


F 


M00043131B.A09 


CKi9COP 


1554 


422703 


RTA00002687F.h.04.1.P.Seq 


F 


M00040219D:E08 


CHUEDT 


1555 


450966 


RTA0000269IF.e.05.2.P.Seq 


F 


M00043386A:B08 


CHrCOHLV 


1556 


451058 


RTA0000269lF.b-24.3.P.Seq 


F 


M00043338B:Cli 


CHiXOHLV 


1557 


451058 


RTA0000269)F.c.01.3-P.Seq 


F 


M00043338B:Clt 


CH:"COHLV 


1558 


414331 


RTA00002685F.ci42.PSeq 


F 


M00039376D-.H07 


CH12EDT 


1559 


453398 


RTA00002693F.a.l4.2.P.Seq 


F 


M00043073A:C12 


CHI9COP 


1560 


454962 


RTAO0002693F.e. l0.2.P.Seq 


F 


M00043184A:H08 


CKI9COP 


: 1561 


451 193 


RTA0000269 1 F.a.24.3. P.Seq 


F 


M00043304C:D02 


CHrCOHLV 


1562 


447859 


RTA000026S9F.d. 13.3. P.Seq 


F 


M00042737C.H04 


CH15CON 


1563 


454335 


RTA00002693F.e.l4.2.P.Seq 


F 


M000431S7A:C04 


CHI9COP 


1564 


456522 


RTA00002694F.e. 1 6. 1 .P.Seq 


F 


M00043649B:£07 


CH20COHLV 


1565 


379167 


RTA00002692F.c.09.2.P.Seq 


F 


M00042983C:AU 


CHISCON 


1566 


402763 


RTA00002686F.C. 1 1.1. P.Seq 


F 


M00040136CF08 


CH13EDT 


1567 


450956 


RTA0000269 1 F.b.04.3.P.Seq 


F 


M00043306C:B03 


CHrCOHLV 


1568 


448999 


RTA00002690F.d.08.3.P.Seq 


F 


M00042792A.HOI 


CH16COP 


1569 


448479 


RTA0000269 1 F.a.20.3. P.Seq 


F 


M00043296B.G09 


CHl'COHLV 


1570 


414331 


RTA00002685 F.c. 14. 1 .P.Seq 


F 


M00039376D:H07 


ch;2Edt 


1571 


390 1 ^4 


RTA00002693F.d.l3.2.P.Seq 


F 


M00043!62A:B08 


CH :9COP 


157^ 


174^50 


RTA00002689F.C.09.1.P Sea 


F 


M00042697D:C07 


CKI5CON 


1573 


44S99 I 


RTA00002690F.d.02.3.P.Seq 


F 


M00042784A.H06 


CH16COP 


1574 


453909 


RTA00002693F.e.0L2.P.Seq 


F 

— 


M00043173D:G03 


CHI9COP 


1575 


448933 


RTA00002693 F.c.05. 1 .P.Seq 


F 


M00043109CG0I 


CHI9COP 


1576 


44903 ~> 


RTA00002690F.a. 1 5. 3. P.Seq 


F 


M00042436B.H09 
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M00039180A.A07 


CHO^LNL 


1739 


377557 


RTA000026/ 1 F t. 19 j.P.Seq 


c 
r 


l | AAA"* O ^ /I r*V. PA'! 




1740 


402056 


RTAOOO0* , 686F.1. 12.1. P.Seq 


F 


M00040260B.D02 


CHI3EDT 


1741 


402424 


RTA00002686F.k. 15.1. P.Seq 


F 


M0004Q253CA05 


CH13EDT 


1742 


379575 


RTA00002679F.p.23. 1. P.Seq 


F 


M0003977IC:E1 1 


CHO^LNL 


1743 


22651 


RTA000027O8FT. 13.1. P.Seq 


F 


M00004I44A:H05 


CHOiCOH 


1744 


376151 


RT A000026 75 F . n. 04 . L P Seq 


F 


M00039255CE12 


CH0°LN"L 


1 1745 


377183 


RTA00002683F.e. 15.1. P.Seq 


F 


M00040052D:F12 


CK0°LNL 


1746 


379094 


RTA000026::F.o.07.2.P.Seq 


F 


M00059045B:E01 


CHG°LN f L 


1747 


38891 


RTA0000267;F.h.24.2.P.Seq 


F 


M00t)394O]B:D02 


CH0°LNL 
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1 7J.9 
I /*+0 


J f O JUO 


RTAftflOO"630F I M 7 P Sea 


F 


iVIOOO^S^OA HI 1 


CH09LNL 


1 7 JO 

I I 4 * 7 


J /07J J 




F 


M000400-OD' A06 


CH09LNL 


1 7sfl 




RTAonno" t 66"F f 17 1 PSea 


F 


M000^64B 010 


CH03MAH 


1 7^ 1 
1 / J 1 


"iff (07^ 


RTAOno0' ? 6"0F k "*0 1 PSea 


F 


M000334^4A D09 


CH09LNL 


1 7S7 




RTA0n00"''669F f 10 3.P Sea 


F 


M00033* > 05AFO3 


CH08LNH 




dd(\ 1 10 


RTA0000"639F b 13 l .P.Sea 


F 


M00042565C:A08 


CH15CON 






RTAOO00^633F c 1 1 ^ PSea 


F 


M00040040A A06 


CH09LNL 


1 7SS 
1 / J J 


J 1 700 


RTAnnno"*" ! 3F i 16 1 PSea 


F 


1^000^746^ BH07 


CH04MAL 


I / ju 




RTAOnOfl^'ORF h 04 1 PSea 


F 


M00004' 7 41BH07 


CH01COH 


• I7S7 


7*r 1 UO 


RTA0000 n ~ I "'F i 01 I PSea 


F 


MO0O°6946A FI2 


CH04MAL 


l 1 JO 


J / OoVO 


RTAnnnrrfiTTF i 03 "> P Sea 


F 


1^0003940^ BE03 


CH09LNL 


1 / J7 


17r»4AQ 
j / 0*+0" 


RTAonno^ A"4F h 06 \ P Sea 


F 


M00039140D- A04 


CH09LNL 


i / Dw 


HJJ I ~* I 


ota nnnfP ft04F r 06 I P Sea 


F 

■ 


M00043458ABP 


CH20COHLV 


1 7 A I 
1/0 1 


J 1 J Jo \ 


R T a HfiflfP 3 1 F a0 7 1 P Sea 


F 

1 


M00040031 A E06 


CH09LNL 


1 7A7 


I OU I VO 


RTAfinnn^AAiF f 1 1 IP Sea 


F 


M000" , ^" ) 34CD06 


CH03MAH 


1 7A*J 


IojV4j 


0TAnnnn7~ r *; f h I 1 PSea 


F 


MOOO^iTryRos 


CH04MAL 


1 7A,1 
t /04 


AAA. 1 
440 1 jy 


DTAnnnnTASOF h n i P ^t?a 


F 
* 


M0004" , ^65C A08 


CHI -^CON 

lit** ^» r ■ » 


1 7£^ 


17Q 1 97 


dt a nnrtm a 97 P r I ■> 1 P *\*»n 

K 1 AUUUU— Oo— r . c. i J. I . r . jCLj 


F 


M000"99 -, 8BG05 


CH09LNL 


I /oo 


j/oiuU 


K 1 AUUUVJ _0 v j r .i,uo — ^.oc^ 


F 

r 


MOOtUi^Ol A R09 


CH19COP 

^« 111' V— v_/ * 


1 /67 


ic\ z.c\c 

j /yjvo 


K I AUUUU_0«5 1 r.C. 1 U.— .r.OC^ 


F 

r 


M000i9Si IT D17 


CHO^LNL 

^— • IV/ - L^i 1 *-. 


1 ZOO 


^ C*7 1 C 
JJ f I J 




F 

r 


M00001 o"6A H I 1 

1 V | vVvV 1 J _ \J i \ .1111 


CH0ICOH 


i inn 


42ojUU 


K 1 AUUUU- OO? r .p.UO. i - r. 


F 


M0005'' 50RB- H03 


CH08LNH 


1 77A 
1 / /U 


.nog t 


ota nrif\n7/iA7F ■« if) i p 9*»n 
K 1 AUUUU — OO / r .□. l v. 1 .r . jcL| 


F 

r 


M000j" > 7PBG0" ) 


CH08LNH 


1 T"7 1 
1 / / 1 


j /o9I 1 


K 1 AUUUU^.0 -r.n — — r.oci^ 


F 

r 


M000^904" > R- RO' 7 


CH09LNL 


1 7~*7 
1 / /2 


j7joV7 


K 1 AUUUU— o • oT.a.u i .—.r.oc^ 


F 
r 


M0OfP94S4FV A 1 1 


CH09LNL 


1 771 
1 / fJ 


-> /-iooo 


DTArtrkn^">A~nF s "»7 7 p <\,»n 


F 

r 


M000™4 1CH08 


CH09LNL 


1 77*1 


1790 1 1 


oTAnfinn"*^ — >f n o i ^ P Se»a 

K I AUUUU— O ■ — r -U.u i — r .jc^ 


F 


MO003904^BBO -) 


CH09LNL 


t 77C 
I / fJ 


1 T7 ,1 < I 


OT A flflfiA7 A Al P *» 17 1 P ^r»n 
K i AUUUU— OO J r .a. I — . 1 ,r . jci^ 


F 

r 


MOOOOSO^ftB C 1 1 


CH03MAH 


1 HI A. 

1 / /o 




OTArtfinA"'"* 1 1 P r 11 1 P *\t»n 
K 1 AUUUU— . I I r .C. 1 l.r .ocLj 


F 


M000^8^6C A 07 


CH03MAH 

Xt t ft \* *s » * • J »» ^ 


1 77*7 
I / / / 


J / J 11 


K 1 AUUUU- • 'JOr.i — U. i.r.oci^ 


F 


M000041 ^5D- A 10 


CHOlCOH 


1 779 


,11 1 .1 1 Q 


DTAnnfin^^nOF i ^l 1 P Sea 

rv I AUUUU — OQ Vr J.— J.J.r .ocu| 


F 
i 


M0003 3^6 1 C" DP 


CH08LNH 

ft ft w w ► • * 


1 77Q 


1 Q£. "1A(\ 

1 oO JOU 




F 

* 


MOOO^O'B F07 


CH04MAL 


1 79fl 
1 /oU 


hjU / j 1 


BTAHfWPririQF i 1 1 "> P Sea 

|\ 1 rtUUUU-007r .J.I I 1 .JCij 


F 


M00033" , 48A BO"* 


CH08LNH 


1 79 t 


177 -n ~77 

J /_J / 2 


PT A nn.fif)7 a ~oF o °0 1 P Sea 
KIAUUUU—O ur.s.-U.i.r.oc^ 


F 


M0003*410BC09 


CH09LNL 


1 797 
1 /o_ 


J /OV 1 j 


DT AnflftfPr>9." F m 04 7 p *s^a 


F 


M00040089C E06 


CH09LNL 

X^h* V ft X^ ' * ^i— 




l7AQQfi 

J /0 77U 


RTAnonfl" , 6J?'iF f 09 "> P Sea 


F 


M0004005 ;; DB01 


CH09LNL 


1 79J 
I / o-4 


■ ■ i — , . 

COCAO 
JOJ\JO 


RT Anonn^Afi 1 F e 17 1 PSea 


F 


M00003786A A 1 1 


CHOICOH 


1 79x 

I / OJ 


1 90 1 lO 


PT Annnn'*Afi4F h 1 4 "* P Sea 

|\ | /\UUUU— OOti.U. I -r. — .r . 


F 


M00XP6SMB F01 


CH04MAL 


I / oO 


1 9 i A7 < 


PTAnOOn^fi^OF k ''O "* P Sea 


F 


1^1000334^4 A DO 9 


CH09LNL 


1 797 

t to f 


HQ 1 7A 

J 17 \ -0 


PTAnnnn^^^lF nfi^P Sea 


F 


M0004009" , BF05 


CH09LNL 


1788 


377633 


RTA00002684F.g. 1 5.2.P.Seq 


F 


M00040304B:F06 


CH09LNL 


1789 


430284 


RTA00002667F.k.06.l.P.Seq 


F 


M00032831C:G07 


CH0SLNH 


1790 


374773 


RTA0000:6"6F.I.22.5.P.Seq 


F 


M00039316A:COI 


CH09LNL 


1791 


403761 


RTA00002687F.m.03.I.P.Seq 


F 


tM00040327B:G06 


CH14EDT 


1792 


375547 


RTA000026""F.m.04.2.P.Seq 


F 


M00039417A:E12 


CH09LNL 


1793 


80436 


RTA0000266 1 F.c.09. 1 .P.Seq 


F 


M00001582A:E02 


CHOICOH 


1794 


189139 


RTA00002664FbU4.LP.Seq 


F 


M00026351B:F0I 


CH04MAL 


1795 


376614 


RTA000026 T 7F.c.05.2.P.Seq 


F 


M00039341D.D07 


CH09LNL 


1796 


404513 


RTA000026SSF.d. 1 3. 1 .P.Seq 


F 


M00040390B:F02 


CHI4EDT 
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1797 


375714 


RTA000O2677F.m. 1 3.2.P.Seq 


F 


iM000394l7C:G0l 


CH09LNL 


1798 


51564 


RTA000027 !2F.d.23. 1 .P.Seq 


F 


M00023398B:D12 


CH04MAL 


1799 


399551 


RTA00002687FT. l3.2.P.Seq 


F 


M00040203D:Hll 


CKMEDT 


1800 


133512 


RTA00002 693F.e-24.2-P.Seq 


F 


M00043200A.H09 


CHI9COP 


1801 


375176 


RTA00002675F.p. 13.1. P.Seq 


F 


M00039266D:H04 


CH09LNL 


1802 


375704 


RTA00002676F.h. l3.2.P.Seq 


F 


M00039300CG04 


CH09LNL 


1803 


399551 


RTA00002687FT. 15.1. P.Seq 


F 


M00040203D:HI1 


CH!4£DT 


1804 


403357 


RTA00002687F.i-05.2.P-Seq 


F 


M00040298B.G02 


CH14EDT 


1805 


345 1 3 


RTA00002709F.C.22. 1 .P.Seq 


F 


M00005555A:AI0 


CH02COH 


: 1806 


121871 


RTA000027 1 3F.3.09. 1 .P.Seq 


F 


M00027I98B:B08 


CH 04. VIAL 


1807 


32095 


RTA00002662F.d. 1 5.2. P.Seq 


F 


M00007I I2CB10 


CH02COH 


1808 


403 1 83 


RTA00002687F.n.02. 1 .P.Seq 


F 


M00040332D:B05 


CHUEDT 


1809 


168691 


RTA00002663F.J.02.1. P.Seq 


F 


M00022615D.G05 


CH03MAH 


1810 


430854 


RTA00002668F.p.2 1 .2.P.Seq 


F 


M00033I73D:COI 


CH08LNH 


1811 


377987 


RTA00002679F.h.08. 1 .P.Seq 


F 


M00039682A:C08 


CH09LNL 


1812 


428408 


RTA00002665F.p.23.l .P.Seq 


F 


M00032513D:F01 


CH08LNH 


1813 


375930 


RTA00002677F.h.03.2.P.Seq 


F 


M00039396D:B04 


CH09LNL 


1814 


28453 


RTA000027llF.h.07.1.P.Seq 


F 


M00023094A;B11 


CH03MAH 


1815 


1 19478 


RTA00002686F.n.07. 1 .P.Seq 


F 


M00040271C:D08 


CH13EDT 


1816 


403 1 89 


RTA00002687F.g. 1 6.2-P.Seq 


F 


M000402I7D.B07 


CH14EDT 


1817 


129692 


RT A00002679 F .e. 1 3 . 1 . P . Seq 


F 


M00039673A;F09 


CH09LNL 


1818 


86668 


RTA00002664F.a. 1 0.2.P.Seq 


F 


M00023352B:F03 


CHOJMAL 


1819 


403357 


RTA000026S7F.L05. 1 .P.Seq 


F 


M00040298B:G02 


CH14EDT 


1820 


373198 


RTA00002670F.p.0 1 .2.P.Seq 


F 


M00033578D:G02 


CH09LNL 


1821 


373198 


RTA00002670F.O.24.2. P.Seq 


F 


M00053578D:G02 


CH09LNL 


1822 


25233 


RTA000027 1 1 F.b.06. 1 .P.Seq 


F 


,M00022823C:C01 


CH03MAH 


1823 


403429 


RTA00002687F.a.07.2. P.Seq 


F 


M00039746D:Dll 


CHUEDT 


1824 


4171 19 


RTA00002636F.L 14.1. P.Seq 


F 


M00040222D:G02 


CH15EDT 


1825 


376066 


RTA00002680F.C. 1 2.2.P.Seq 


F 


M00059781D:D10 


CH09LNL 


1826 


403189 


RTA00002687F.2. 1 6. 1 .P.Seq 


F 


t V1000^02l7D:B07 


CHUEDT 


1827 


403429 


RT A00002687 F . a.07 . 1 . P . Seq 


F 


M00059746D:DII 


CHUEDT 


1828 


430975 


RTA00002669Fj.06.3.P.Seq 


F 


M00033246C:E08 


CH08LNH 


1829 


427544 


RTA00002665F.e.03. 1 -P.Seq 


F 


M00023354A:B12 


CH08LNH 


1830 


401155 


RTA00002685F.0. 1 2. 1 .P.Seq 


F 


M00039630A:C08 


CK12EDT 


1831 


377005 


RTA 00002682 F. k. 1 5. 1 .P.Seq 


F 


M00040005D:B07 


CH09LNL 


1832 


379032 


RTA00002683 F.a.07. 1 .P.Seq 


F 


M00040032A:D09 


CH09LNL 


1833 


400097 


RTA00002685F.g. I9.2.P.Seq 


F 


M00039521A:A02 


CH 12EDT 


1834 


383401 


RTA00002670F.k. 1 3.2.P.Seq 


F 


M00035450C:A02 


CH09LNL 


1835 


379052 


RTA00002683F.a.07.2.P.Seq 


F 


M00040032A:D09 


CH09LNL 


1836 


429663 


RTA00002667F.m.2 1 . 1 .P-Seq 


F 


M00032864B:B09 


CH08LNH 


1837 


374018 


RTA00002672F.a.l4.2.P.Seq 


F 


M00053632C:B09 


CH09LNL 


1838 


375409 


RTA00002678F.n.02.2.P.Seq 


F 


M00039616B:C0I 


CH09LNL 


1839 


40 1 1 55 


RTA000026S5F.O. l2.2.P.Seq 


F 


M00039630A:C08 


CH12EDT 


1840 


13958 


RTA000027 1 1 F.b-02- 1 P.Seq 


F 


M00022817A:H02 


CH03MAH 


1841 


38767 


RTA00002687F.a.l I.I. P.Seq 


F 


M00039748C:FI1 


CHUEDT 


1842 


29398 


RTA00002665F.C.23.I. P.Seq 


F 


M00022015B:B07 


CH03MAH 


1843 


12453 


RTA00002709F.C.23.2. P.Seq 


F 


M00005556B:D02 


CH02COH 


1844 


38767 


RTA00002687F a. ! 1 . 2. P. Seq 


F 


M00039743C:FI I 


. CHUEDT 


1345 


279835 


RTA00002671F f.05.2. P.Seq 


F 


M00038279C:AM 


CH09LNL 
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1846 


188592 


RTA00002664F.e. 1 8.2.P.Seq 


F 


M00027141CH03 


CH04MAL 


1847 


376469 


RTA00002674F.h.06.2.P.Seq 


b 


M00039!40D:A04 


CH09LNL 
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seO 

ID 


CLUSTER 


SEQNAME 


ORIENTATlOiS 


f CLONE ID 


LIBRARY 


1 


10600 


RTA00002S9!F.j.07. LP.Seq 


F 


M0OOO3753B:DO7 


CH01COH 


2 


18327 


RT.AO0OO290OF. d. 12. LP. Sea 


F 


M000054I3D:A05 


CH02COH 


w 


1759 


RTA000029:3F.t'.23. LP.Seq 


F 


M00039243CA03 


CH09LNL 


4 


10924 


RTA00002907F.k.l2.LP.Seq 


F 


M00022224A.C07 


CH03MAH 


5 


45331 


RTA00002905F.1. 10. LP.Seq 


F 


M00007037D:D10 


CH02COH 


6 


42233 


RTA00002912F.2.24. LP.Seq 


F 


M00027359B:A06 


CH04MAL 


7 


7211 


RTA00002909F.h.06. LP.Seq 


F 


MO0022634A:CO7 


CH03MAH 


3 


21395 


RTA00002S90F.k. 16. LP.Seq 


F 


M00001637D:C12 


CH01COH 


9 


3093 


RTA00002923F.e.O3. 1 .P.Seq 


F 


M00039225A:D11 


CH09LNL 


10 


15806 


RTA00002S94F.L07. LP.Seq 


F 


M00003991A:C11 


CHOICOH 


11 


19739 


RTA00002S96F.d. 12. 1. P.Seq 


F 


M00004147CE01 


CH01COH 


i: 


140879 


RTA00002905F.C. 1 7. 1 .P.Sea 


F 


M000079S5CD0S 


CH03MAH 


13 


29706 


RTA0000290SF.1.22. LP.Seq 


F 


M000224S7B:A0S 


CH03MAH 


14 


109581 


RTA000029 1 SF.i.OS. 1 .P.Seq 


F 


M0003290SA:DOS 


CHOSLNH 


15 


25009 


RTA00002906F.k. 1 1. t.P.Seq 


F 


M00022016B:F01 


CH03MAH 


16 


8328 


RTA00002SSSF.e.07. LP.Seq 


F 


M0000l45lC:Ei0 


CHOICOH 


17 


15045 


RTA0O0028S7F.e.O6. LP.Seq 


F 


MOOOOI393C:E0S 


CHOICOH 


13 


21216 


RT A0000239SF.p.22. 1 .P.Seq 


F 


M000044l6B:Gi0 


CHOICOH 


19 


185754 


RTA000029 12F.1.09. 1 .P.Sea 


F 


MOO0275O6B:G01 


CH04MAL 


20 


11831 


RTAO00O2909F.h. 10. LP.Seq 


F 


M00022fc>3SA:D03 


CH03MAH 


21 


1859S9 


RTA000029 lOF.h. 1 2 .LP.Sea 


F 


M00022924C:F0-t 


CH03MAH 


22 


9667 


RTA00002923F.a.03. LP.Seq 


F 


M00039162D:C04 


CH09LNL 


23 


15S17 


RTA00002903F.O.03. LP.Seq 


F 


M00007103D:C02 


CH02COH 


24 


10198 


RTA00002923F. i.09. LP.Seq 


F 


M00039294C:B09 


CH09LNL 


25 


6355 


RTA00002S94F.p.I2. LP.Seq 


F 


M00004055D:D05 


CHOICOH 


26 


12227 


RTA00002909F.e. IS. LP.Seq 


F 


M00022601B:G06 


CH03MAH 


T* 

£- / 


11047 


RTA00002S93F.O.06. LP.Seq 


F 


M00003960D:C12 


CHOICOH 


28 


1870 


RTA000029lOF.m.OS. LP.Seq 


F 


M00023020CH03 


CH03MAH 


19 


20C65 


RTAG0C0290SF.rn.09. LP.Seq 


F 


M00022-°l \:AOS 


CH03MAH 


30 


19454 


RTA000029C0F.m.23. LP.Seq 


F 


M000053~9A:D10 


CH02COH 


31 


48048 


RTA00002922F.m. 1 3. 1 .P.Seq 


F 


M00039124D:H01 


CH09LNL 


32 


19799 


RTA0000290SF.h. 19. LP.Seq 


F 


M00022-i49D:FOS 


CH03MAH 




135562 


RTA000029 i 1 F.m.07. 1 .P.Seq 


F 


M00027093A:H0: 


CH04MAL 


34 


24214 


RTA00002S9 1 F.k. 19. LP.Seq 


F 


M00003~64D:FO~ 


CHOICOH 


35 


5172 


RTAOO0O2903F.p.22. LP.Seq 


F 


M00022525B:D09 


CH03MAH 


36 


50495 


RTA00002S9SF.C. 16. LP.Seq 


F 


M0000432lC:Cli 


CHOICOH 


J 1 


43287 


RT AO0O0290S F.k. 1 6. 1 .P.Seq 


F 


M000224~OD:BO: 


CH03MAH 


3S 


15324 


RTA00002905F.p.20. LP.Seq 


F 


M00021<?9~C:BO~ 


CH03MAH 


39 


22157 


RTA000023SSF. 2.07. LP.Seq 


F 


M00001461D:B10 


CHOICOH 


40 


15249 


RTA00C029 1 5F.1.0S. 1 .P.Seq 


F 


M000324S9B:Gi: 


CHOSLNH 


41 


2764 


RTA00002925F.0.1 1. LP.Seq 


F 


M00039c>:9B:E0; 


CH09LNL 


42 


23838 


RT A00002SS9F.b. 1 4. 1 .P.Seq 


F 


MC0001?iSB:DiO 


CHOICOH 


43 


11074 


RTA0OOO2S99F.g. 22. LP.Seq 


F 


M0000-;c03C:C10 


CHOICOH 


44 


18367 


RTA000029:2F.b.09. LP.Seq 


F 


M0003Sc;9D:Ci2 


CH09LNL 




21703 


RTA0000290:F.m OS. LP.Seq 


F 


M0OOO"O59B:D0" 


CHOXOH 


46 


21470 


RTA0O0O:S9f F.c. 14. LP.Seq 


F 


M00004iV7B:D03 


CHOiCOH 


47 


15492 


R7A0000290~F.p.0b. LP.Seq 


F 


mooo:::s:b:C09 


CH03MAH 


43 


4022 


RTA00002S9>".i.22. LP.Seq 


F 


M000042o9B:B0- 


CHOICOH 


49 


21579 


RT A00002S9 i F.e.03. 1 P.Seq 


F 


M000OlcSt>B:H0: 


CHOICOH 


50 


IS6233 


RTA00G0Z9! 3 F.c.Ob. LP.Seq 


F 


mooo:*s.;.:b:DO" 


CHO-M.AL 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


51 

*v A 


5410 

•V™ %. V/ 


RTA00002921F.L0S t P Seq 

m X » ■ ■ ^V Xr XV XV *ft> ftw I * t w ► ^V »V p ft. * ft ■ ^V X# %4 


F 


M00033445DG03 


CH09LML 


52 

to/ 


2">4?0 


RTA0000290IF.e. 19 I P Seq 

A X ft. ■ • XV — * XV X^ ftft -r X^ * Br » »* * ft V * ft ■ ft. r ^V Xft 


F 


M00005474C H09 


CHO^COH 

Xft> 4 4 x^ m \^ / 4 4 


53 

J to/ 


140553 


RTA000029 16F n CP 1 P Sea 

& X * * * W xV \f V ft* V \ W ft i I ■ * XV •* * & r A * *J Xt %4 


F 


M0003763SB F0° 


CHOSLNH 

Xsr 4 * X/ W K«>1 ' 4 A 


54 


^3849 

*— > w* W ^» V 7 


RTA00002SS7F.X22 I P Sea 

ft X ft » ft XV XV Xrf XV to» X^ XV * A f vw * *v *v < 4 ■ ^ * fc- 


F 


M000013S6B:FI 1 


CHOICOH 

ft- ft ft. X^ % ft 


55 


21945 

4ft & T *J 


RTA0000' ? 895F.1.' 1 2 l.P Seq 


F 


M00004103CEIO 


CHOICOH 

ft ft X* ft ft ft 


56 

^V V 


7867 


RTA0000290IF p.08.1 P Seq 


F 


M000057lOB:H03 

ft ' ft X^ tor * ft M -_- w * ft ft 


CH02COH 


57 


14533 


RTA0000' ? 896F.1.0 1 . 1 .P.Seq 


F 


M00004179C.B06 


CHOICOH 


58 

*v \j 


5790 

w ' X w 


RTA00002919F.2. 17.1. P.Seq 


F 


M00033080C:A07 


CHOSLNH 


59 


186153 

1 W W I mS 


RTA0000' 7 9 1 IF iP4 I P Seq 


F 


M000270t7A:B09 

ft v ft TftT ^ftT 4ft * ft * ft ft » 


CH04MAL 


60 


10561 


RTA000CP899F h.08 1 P.Seq 


F 


M00004606D:H09 

ft T ft ■ VJf J toj^ » 4 ft> ^ 


CHOICOH 


61 

X/ 4 


2457^ 


RTA0000 o 893F.1.08 l.P Seq 


F 


M00003926A:F1 1 

ft ft. X^ »f ftw • ft *- * a ft 


CHOICOH 


62 


13138 


RTAOOOO^SSSF m 03 1 P Sea 


F 


M00001488C: A03 

ft T ft \f W ft " «^ X«/ • * ft X^ to/ 


CHOICOH 


63 


6701 

w f U I 


RTAO00O°9°' J F cr 18 1 p Sea 


F 


M00039055CA01 

ft ▼ tor ^ tor ^tor^ * * ft ft 


CH09LNL 


64 


P75 1 


RTA0000°904F c 10 I P Sea 


F 


M0000720^BF01 


CH02COH 


65 


3583 


RTAOO0O' 7 9 16F n 0 1 I P Sea 


F 


M0003 9 644CB05 


CHOSLNH 

a ft XV xV a r ■ * » 


66 


P673 


RTA0000790 1 F d 0 4 I P Sea 


F 

4 


M00005463AG02 


CH02COH 

- ft ft ftfti x*_» ft A 


67 




RTAO000°901F I 0 1 I P Sea 


F 

4 


M000056 7 3BG01 

4 T 4 Vw Vw W W to/ 4a^ » W \p/ 4 


CHOICOH 


6R 


9 lO^ 


RTAOOOO^^^F k °4 1 P Sea 


F 
i 


M000391 1 1 A-CP 

!▼ A w\-/ w^-/ ✓ A 1 i * V » x^ A ft_ 


CH09LNL 

\w- A 1 w / Av«t A ~ Aw 




JUJ7U 


RTA0000^9 19F a ^4 1 P Sen 


F 


M00033081DD1 1 


CHOSLNH 

A AX/ w ft ~ 4 4 


70 




RTA0000°S90F c 14 1 P Sea 


F 


M00001596ADO' 7 

4T1 V - f\_/* w # W ft. to/ -/ 4 ft * X/fttf 


CHOICOH 

X_f 4 IV 4 X»* X#-* 4 4 


7 1 




RTAOOOO^QOOF 0 14 1 P Sea 


F 


M00005401D* P 09 


CHOICOH 

Xvr & 1 V Vv XV 7 ■ * 




1 000 


RTA0000^9 1 SF * ">0 1 P Sea 


F 


M0003°S97C 'C P 


CHOSLNH 

X^ ft ■ ftftr ^ * ft ft 


73 


447^7 

*T*+ I J 1 


RTA0000^901F a^O 1 P Sea 


F 


M00005434 AC03 

A~ A Ww \J^f~ to/ ~ * * - Xto« w a/ 


CHOICOH 

r" ft ■ W 4— X^^> * ft ft 


Id 


41 


RT 40000^9 1SF n 0 3 I P Sea 


F 

A 


M0003' ? 98SB-G01 

A" A w wX/tor « ^ ww A-^ - X-^ w 4 


CHOSI NH 

X^_^ ft ft W 1^ * * • ft 


75 


4188^ 


RTA0000 ? 90 7 F d P I P Sea 

XX f 4 A WW ft* J W ,ftA-ft.-».4.A - >Jw vi 


F 

4 


M000065S6DD04 


CH02COH 


76 


500 


RT A0000' ) 9' ? 5F o 18 I P Sea 


F 

4 


M00040034AE06 

4*4 WV • VV*r ~ 4 4 * JUft X/ 


CH09LNL 

\r » ft X^ tos* ft * ft«* 


77 


JHJJ 


RTAOOOO'^O'' IF f ">0 1 P Sea 


F 

4 


M000334 ? OB E08 

^ ▼ |\/\/v/ to/ to/ » toa W ft_^ • ft^ \J 


CH09LNL 

ft a> XV ft * ft ' ftii ft 


78 


1 JO — 7 


RTA0000^900F i 01 1 P Sea 


F 

4 


M00005314A:GIO 

4 r * X* \^ W Vv — ^ «^ ft » Aft* V»* * 


CH02COH 


79 


1 "S408"i 


RTAO000" 1 9O"F a 06 1 P Sea 


F 

A 


M000-^096D: A03 

4 T ft X* tor ft»- J ■ _J » » ft \^ to/ 


CH03MAH 


80 


7418 1 


RTA00007910F i 16 1 P Sea 


F 

A 


M000' ? ' ) 953BD06 

ft ~ iVV/V/to ✓ to/ to/ » *^ vv 


CH03MAH 


81 


107940 


RTAO0OO°930F r*07 1 P Sea 


F 

4 


M00055735AH08 

4*4 VW*/ to/ f •/ / V * 4 AW W 


CH15C0N 




94761 


RTAOOOO^OO^F I 71 ] P Sea 


F 

A 


M00006756C* AO 7 

4 ~ 1 WVW ' to/ X/^ta^ - » * X/ 


CH02COH 


O J 


10714 


RTA0000" > 9^4F e 07 1 P Sea 

IX 1 riV/VVV™ 7 .t.V-. 1 >i .JUU 


F 

A 


M00039457DCO' 7 

4*1 WX/ •/ » F 4»^ ♦ X»y W to* 


CH09LNL 


84 


40140 


RTA00007897F d^j 1 P Sea 


F 

A 


M00004303CC05 


CHOICOH 

^to^ -ft A XV ft ft ft 


85 




RTA0000°930F i 07 1 P Sea 


F 

A 


M00056057CF06 

ft * A. W ^^^^ to/^ W f * *• V 


CH15C0N 

ft • ft to' ^tor V 


86 


7896 


RTA0O0O°9O6F i 08 1 P Sea 


F 

4 


M0OCP1998BDO9 


CH03MAH 


87 
o / 


i'rJO / 


RTA0OO0°S96F e 09 1 P Sea 


F 

A 


M00004151B- A07 


CHOICOH 

X^ ft ft XV ft ^toi ^V ft, ft 


88 


94^0 


RTA00007SS9F h 0° 1 P Sea 


F 

A 


M00001546BC 1 1 


CHOICOH 

X^_^ ft ft XV • x^ x^v 4 * 


89 

07 


1041 1 


RTA00007SS7F d 07 1 P Sea 


F 

A, 


M000014"'9B*G05 

4 * ft ^/^^^^ ft ' to* ^ * • \^ to/ 


CHOICOH 

^to^ -ft- ft V ft ^to*. XV ft ft 


00 


1 4UDJ 


RTA0000°90SF a 07 1 P Sea 


F 


M 000° 0 4° 5 A • C 09 

ft * L to* to* t ft* to/ « ft * X^ \-/ 


CH03M AH 

X- - ft ft XV to/ — fc * ft ft ft 


91 


10302 


RTA00002906F.o.03.LP.Seq 


F 


M000220S1A:B07 


CH03M.AH 


92 


28436 


RT A000029OS F. f.O S . 1 . P . S eq 


F 


M00022415C:D12 


CH03MAH 


93 


17829 


RTA00002SS9F.g.l 1. 1. P.Seq 


F 


M00001544B-.E06 


CHOICOH 


94 


10390 


RTA00002906F.e. 13. LP. Seq 


F 


M00021923A:B12 


CH03NLAH 


95 


11619 


RTA000029 1 3F.C.07. 1 .P.Seq 


F 


M00027S06C:HO5 


CH04MAL 


96 


6890 


RTA000029lSF.m. 19.1. P.Seq 


F 


M00032979D:H07 


CHOSLNH 


97 


10110 


RTA00002S97F.0. 1 3. 1 P.Seq 


F 


M00004245C:G10 


CHOICOH 


98 


21511 


RTA00002S92F.h.24.:.P.Seq 


F 


M00003S21C:E12 


CHOICOH 


99 


92S7 


RT A00002 S99F.h. 14. IP. Seq 


F 


M0000460SA:C10 


CHOICOH 


100 


16575 


RTA00002S95F.n.02.l.P Seq 


F 


M000041I0D:F0v) 


CHOICOH 



110 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


101 


16857 


RT A00002 S99F. h . 1 7 . 1 . P . Seq 


F 


M00004609A:E09 


CH01COH 


102 


33329 


RTA00002S9 IF.p. 14. l.P.Seq 


F 


M00003787D:A10 


CH01COH 


103 


40652 


RTA00002896F. f.2 1. l.P.Seq 


F 


M00004153B:E03 


CH01COH 


104 


8070 


RTA00002925F.C.09. 1 P.Seq 


F 


M00039828B:H06 


CH09LNL 


105 


15880 


RTA000028S7F.2. 1 1 . 1 .P.Seq 


F 


M00001397CH0S 


CH01COH 


106 


87418 


RTA0000290 1 F.g. 10. 1 .P.Seq 


F 


M00005500A:D04 


CH02COH 


107 


9961 


RTA00002903F.1.2 1 - l.P.Seq 


F 


M00007046D:C09 


CH02COH 


108 


9966 


RTA000029 1 lF.o.20. l.P.Seq 


F 


M0OO27168B:HO8 


CH04MAL 


109 


17513 


RTA00002906F.d.03. 1 .P.Seq 


F 


M00021S96D:A05 


CH03MAH 


110 


24835 


RTA00002924F.f. 18. l.P.Seq 


F 


M00039554D:B09 


CH09LNL 


111 


15200 


RTA00002891F.j.08. l.P.Seq 


F 


M00003758B:F06 


CHOICOH 


112 


124098 


RTA00002905F.e.23. l.P.Seq 


F 


. i\100008020D:D05 


CH03MAH 


113 


3786 


RTA00002901F.e.05. 1. P.Seq 


F 


M00005466C:B0I 


CH02COH 


114 


154121 


RTA00002906F.p.02. 1 .P.Seq 


F 


M00022083B:F10 


CH03MAH 


115 


5746 


RTA000029 18F.L09. l.P.Seq 


F 


M00032945D:B07 


CH08LNH 


116 


33700 


RTA0000290 1 F.e .09. t .P.Seq 


F 


M0000546SA:C04 


CH02COH 


117 


5660 


RTA00002S90F.b. 1 1 . 1 .P.Seq 


F 


M00001591B:H05 


CHOICOH 


118 


22732 


RTA00002924F.O.23. l.P.Seq 


F 


M00039785C:H12 


CH09LNL 


119 


14720 


RTA00002892F.j.05. 1 P.Seq 


F 


M00003825A:H10 


CHOICOH 


120 


13658 


RTA00002896F.C.04. 1 .P.Seq 


F 


M00004143B:B04 


CHOICOH 


121 


23150 


RTA00002S87F.a.O 1 . 1 .P.Seq 


F 


M00001384A:A07 


CHOICOH 


122 


11970 


RTA0OOO29O3F.k.03. l.P.Seq 


F 


M00007002C:A10 


CH02COH 


123 


10686 


RTA00002915F.p.02.2.P.Seq 


F 


M00032519D:F08 


CHOSLNH 


124 


9588 


RTA00002923F.m. 10. l .P.Seq 


F 


M00039331B:F09 


CH09LNL 


125 


8500 


RTA00002925F.e.2 1 . 1 .P.Seq 


F 


M00039360D:B02 


CH09LNL 


126 


8615 


RT A00002907 F. 1 . 1 7 . 1 .P.Seq 


F 


M0002223SC:G04 


CH03M.AH 


127 


7524 


RTA00002886F.e. 16. l.P.Seq 


F 


M00001348B:B03 


CHOICOH 


128 


325 


RTA000029 l2F.g.02. l.P.Seq 


F 


M00027332B:H09 


CH04MAL 


129 


10214 


RTA00002SS9F.O.0 1. l.P.Seq 


F 


M00001570A:B07 


CHOICOH 


130 


23534 


RTA00002S89F.C.22. l.P.Seq 


F 


M00001533D:A01 


CHOICOH 


131 


7473 


RTA00002893F.O.20. 1 .P.Seq 


F 


M00003965D:D11 


CHOICOH 


132 


185625 


RTA000029 12F.f. 10. 1 .P.Seq 


F 


M00027314D:E02 


CH04MAL 


133 


3920 


RTA000029 1 7F.m.07 . 1 .P.Seq 


F 


M00032773D:F08 


CHOSLNH 


134 


8458 


RTA000028S9F.m.02. 1 .P.Seq 


F 


M00001562D:B07 


CHOICOH 


135 


20263 


RTA00002906F.n.03. l.P.Seq 


F 


M00022073C:C07 


CH03MAH 


136 


186141 


RTA000029 l2F.f.04. 1 .P.Seq 


F 


M00027311A:H09 


CH04MAL 


137 


4852 


RTA000029 19F.L 15. 1 .P.Seq 


F 


M00033150B:E02 


CHOSLNH 


138 


2146 


RTA00002926F.a.2 1 . 1 .P.Seq 


F 


M00040061CC08 


CH09LNL 


139 


3522 


RTA00002S97F.h.23. l.P.Seq 


F 


M00004263C:D03 


CHOICOH 


140 


20027 


RTA00002909F.g.2 1 .l.P.Seq 


F 


M00022627B:H03 


CH03M.AH 


141 


15650 


RTA00002925F.g.04. l.P.Seq 


F 


M00039S74A:B06 


CH09LNL 


142 


21031 


RTA00OO29O 1 F.h.2 1 . 1 .P.Seq 


F 


M00005520B:H05 


CH02COH 


143 


95610 


RTA00002909F.h. 18. 1 .P.Seq 


F 


MO0022642A:GO8 


CH03MAH 


144 


903 


RTA000029 12F.O.03. 1 .P.Seq 


F 


M00027591A:E04 


CH04MAL 


145 


17284 


RT A0000 29 l6F.k.lS.l.P.Seq 


F 


M00032620B:F06 


CHOSLNH 


146 


15556 


RTA00002S95F. m.O 1 . 1 . P.Seq 


F 


M00004104A:A12 


CHOICOH 


147 


11013 


RTA00002S97F.b.OS. l.P.Seq 


F 


M00004214D:A05 


CHOICOH 


148 


15358 


RTA00002903F.n. IS. l.P.Seq 


F 


M0000709SA:E10 


CH02COH 


149 


10792 


RTA00002S94F.a. 10. l.P.Seq 


F 


M00003974C:E11 


CHOICOH 


150 


25507 


RTA000O29O 1 F.n.07. 1 .P.Seq 


F | M00005645D:A05 


CH02COH 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


151 


19978 


RTA00002908F.k.2I.LP.Seq 


F 


M0002247~ , D'B01 


CH03MAH 

X— • 1 lv Jl* \i V X X 


152 


31133 


RT-\ 0000^905 F L.P Seq 

p ^ Mt * h Xx XX XX VX ^ai ^ X^ ^X a> w m * «^ * at * -a> v h^r 


F 


M00008045CA05 

J" T * XX XX XX XX XX xX * * # V XX «P 


CH03MAH 

X X W mJ I" 1 * aJ X 


153 


4503 


RTA00002893F h 14.1 P Sea 

■* X% M • • XX XX U XX 4p» XX X^ *X m * ■ f * a * * a\ ■- A - X^r X^ U 


F 


M00003906A C02 

* w & xx xx xx \y x xx xx t » » x_- xx 4# 


CH0ICOH 

x^ 1 ivy x V— * x_y 4 1 


154 


9017 

j vy a ' 


RT=\0000^886F i 03 1 P Sea 

AY X A * Vx Vx ^x XX VX XX X,/ A * ■ VX «— ' t m * A * -* \-» XJ 


F 


MOOOOPoSA E08 

A ~ ■ xx xx xx Vx • <— / vy ■ • • *■ - Vx xx 


CH01COH 

x# 4 A vy A 4 A 


155 

x *x */ 


6635 

X/ Vy ,y »y 


RT AO00O2895F a.07. LP Sea 

<l v & 4 » vx vx xx v w -x *x * 1 * w ■ • 1 * * » ^x 


F 


M00004057DG01 

* r * Vy Xx x/ Vx " \X «X r 4_x ■ X.^ \X A 


CH01COH 

>w 4 4 Vx 4 Xw X*t 4 4 


156 

a Vy 


102^0 

a vy aw * vy 


RT A0000^9° 1 F d 09 LP Seq 


F 


M00033360CA03 

1 r \ Vx XX V ^ W V_^> * * * XX ^ 


CH09LNL 


157 


9831 

-y vy w 1 


RTA0000°896F.b 08. LP Seq 


F 


M00004139BFOI 

\T Mt V XX XX XX ■ A — ' -X * * XX & 


CH01COH 


158 

A mJ Vy 


17^91 


RTA0000^888F c 10 1 P Sea 

AX ■ £ h XX NX XX XX ^» XX XX XX A • V * • >X < * ' A * »fcX U 


F 


M00001442A:F08 

A ~ A XX v vy XX * ' * A h * * XX XX 


CH01COH 


159 


980^ 

VJ V/ w 


RTA0000^916F I P 1 P Sea 


F 


M0003' ? 6' :, 8C'B06 

a T a ^x v vy y vy vx x^ « ■ x xx xx 


CH08LNH 

ah a¥ xx XX a*^ ■ * t ■ 


160 


1338°8 

1. <_/ VX V vy 


RTA0000^908F b 'M I P Sea 


F 


M000' ? 2374CEl 1 


CH03MAH 


161 


25870 


RTA00002909F m ^3-LP.Seq 

1\ A i ft XX V XX V ^X x^ * " i ■ ■ • ^— ^X * * * * * ^X XX XJ 


F 


M000^2702D:E02 


CH03MAH 


16^ 

a \y*» 


97394 


RTA0000^9 PF i 2° 1 P Sea 

4 X m ■ w XX XX XX XX X' & 4 * 4 * 4* ^» » * • • 1 


F 


M0O027433B:D12 

A • a XX XX XX * W ^X ^X • -Bta^ * ^B* 


CH04MAL 


163 


98159 


RTA000029 1 OF f 19 1 P Sea 

1\ A A % XX XX V XX ^» X' 4 XX MV *■ ^ ■ « X^ 


F 


M0O022897B F06 

4 r * XX XX XX *v XX # t^gm * * 


CH03MAH 


164 

X x/~ 


^P64 


RTA0000 7 903F d 15 1 PSea 


F 


M00006904D-ACP 

it 1 xx vy w vy vy x xx t l » • ^ v ^ 


CH02COH 

^taX XX ar A a* 


165 


186199 

1 Vy Vy ± J J 


RTA0000'>91 IF b 13 0 P Sea 

i\ W 4 * Vx V* Vx \J m* <f A A A. ■ V» A> ^x * ^» • A * x-r VjXxJ 


F 


M000'>3394DD10 

4 T ■> xx VX xX 4w x ■ m « a xx 


CH04MAL 


166 


28794 


RTA0000^887F i 14 1 PSea 

1\ X 4 L\f/X> Vx Vx ^ Vx Vx > X * 1 » A T * A * i • ^VU 


F 

jA 


M00001403CB03 

A v A XX XX Vy Vy A ■ XX im* Xw r A—' VX -X 


CH01COH 

x^ A 4h VX A X^v X^X Mi av 


167 


^3 180 


RTA0000' 7 895F i 17 1 P Sea 


F 


M00004093A C03 

A T A XX Vy Vx Vx r Vx -X ^ A \ * X^ XX 


CH01COH 

Xrf" A * VX * Vr V^x 4 H 


168 


? 1099 


RTA00007Q99P 1 01 1 P Sea 


F 


M000391 1 1ACP 

at a vy vy vy »y ^ & a a j a + v* a _ 


CH09LNL 

v# X A Vx X A 




1 <H70 


RTA00007893F i 17 1 P Sea 


F 

X 


M0000391 ICA09 


CH01COH 

v«#- a a vy A V_» x^/ & x 


170 


dROd 


RTA0000' , 9 1SF a 19 1 PSea 


F 


M0003' :, 8' 1 6CD10 

1 ▼ a vy vy u j vy v_- « a-x a vy 


CH08LNH 

Xw 4 A Vy Vx »—f 4 ^ A A 


17 1 


7066 

/ WW 


RTA0000^9 19F 0 07 I P Sea 

Xx 1 ik v/V> Vx Vx x ^x 1 yi »\v » vy ' • I <1 • h^VU 


F 

A 


M00033°46AHP 

*▼ X\y Vy v «/ *m~vri -AAA-* 


CH08LNH 

Xw a xvy vy a> i ^ x x 


17^ 


48997 


RTA0000^903F 0 18 I P Sea 


F 

A 


M000071 17A CI I 

a~ a vy vy vy vy # a a r t v ■ a a 


CH02COH 

x^ a a vy x^^ v^y a av 


173 


20171 
ill 


RTA0000' ? S86F i 15 1 P Sea 

i\ 1 i V X/Vx W toU U Vx A -l*L*x*L*A • <^X- vJ 


F 

1 


M0000 1359A H 10 

atx vy vy vy vy a ^ ✓ * v - a a a vy 


CH01COH 

X>w a. * Vx A ^- -* Xj^ & 


174 


1UJJJ 


RTA0000' , S94F d 10 1 P Sea 


F 

A 


M00004055CB 10 

1T1 vy vy vy vy^r vy ^ ^ \x * a,* a vy 


CH01COH 

X^ A A XX IV Xsrf- v^ A A 


175 


PS9'? 


RTA0000°914F m 08 1 P Sea 

XX 1 AUV vy W» y 1 "I -III- Vy Vx - a - 1 *^JV^u 


F 

A 


M000°8361B H08 

i*i vv/v/«> vy *y vy 1 a*/ • l 1 uu 


CH08LNH 

Xrf » A VX Vx *— » i 1 1 * 


176 


^. .j f \j 1 


RTA0000°S96F i ^ 1 1 P Sea 


F 

A 


M00004171BB0^ 

a 9 a vy vy vy vy a $ a a>x > a^x vy ^ 


CH01COH 

X^K~ * * XX tk. Xb^i X^r m a¥ 


177 


16849 


RTAOO00' 7 91SF b 07 1 P Sea 


F 


M0003^S n 9D- A05 

x ▼ a vy vy vy vy ✓ a^x . * * vx -x 


CH08LNH 

X — . 4± S VX XX Abrf * * A A 


178 


185866 


RTAOO0029 1 1 F c 1 8 0 P Sea 

fx 1 r \ vy Vx vy vy ^/ 1 11 ■ w< 1 UfM»i +iJv*u 


F 

A 


M000°681SCE01 

it iv/v/v/w vy vy a vx x^- * Aw vy x 


CH04MAL 

X^ A A Xx ViW 


179 


90097 


RTA0000°899F b ^0 1 P Sea 

lx 1 0 vvv v .« (y y x 7 i •Lx»_*vy- 1 » &, *xJv^vt 


F 

A 


M00004443C F07 

a w 1 vy vy Vx vy • r » _> x^ » * vy * 


CH01COH 


180 


21975 

I17/J 


RTA0000^90' ? F a 01 1 P Sea 


F 

A 


M00005743D' AP 

A ▼ a vy vy vy vy ^ / ^t+j a^ * * » a — 


CHO^COH 

Vv & 1V*« Vr V^ A A 


18 1 




RTA00007903F b °0 1 P Sea 

IX A i LvvVvw / vyi » vx - — vy - a • A .UvU 


F 

A 


M00006S77C Fl 1 

a T a Vx vy vy vy w vy ' * x^ * a a ■ 


CH02COH 


182 


6034 


RT ^0000^90 IF a P 1 PSea 

fx l *+■ vy vy vy vy « ^ vy a 1 • • i _ • a » a * vj 


F 

A 


M000054^ 1ACI 1 

A * A vy vx vy vy r *w v ' 1 v • x™» * * 


CH02COH 


183 


1 1 "*67 

1 1 «7U_ 


RTAO0007S87F h 06 1 P Sea 


F 

X 


M00001399C- <\01 

x~ a vy vy vy vy \. -^s s * v, < • > vy a 


CH01COH 

Xv * A XX M V^ M & 


184 


20671 


RTA0000^905F a 00 1 P Sea 


F 


M00007947AB06 

ata vy w vy \j * x ▼ # i a » A-X vx v* 


CH03MAH 


185 

X Ly*y 




RTA0000' , 9 17F b C 7 1 P Sea 

Xx X i\.\J\J\J\J — • -y 1 / x • vy* * x • a • t^y v«- vj 


F 


M0003' 7 67 1BD06 

i~ a vy vy vy y vy # a a^ ■ a*x vy vy 


CH0SLNH 

X^. a. 4 XX XX ■ ■ ^ T a¥ -av 


186 




RTA00007897F d 1 1 1 P Sea 


F 

A 


M00004' 7 ^9BB06 

a vy vy vy vy ~ H _ ^ i_y . A-y vy vy 


CH01COH 

v< a a vy a x>y a a 


187 

1 o / 




RTA0000^S97F d^O I P Sea 


F 

X 


M00004' > 30DB05 

l~ l vy vy vy Vy ~ ^ _/ vy 4_y . xy vy *y 


CH01COH 

X^ A A Vx A X* w A A 


188 




RTA0000^9 15F i ^ 1 1 P Sea 


F 

A 


M000314I6DH05 

at a vy vy vy \ » 1 vy x-y - a a vy -x 


CH0SLNH 

X^ A *> XX XX * * * * 


1 89 


4747 


RT AOOOO^Q 19F c ^3 1 PSea 


F 


M0003304 1 A B I I 

1 t 1 \y vy vy *y ^ vy t 1 . v . xy a a 


CH0SLNH 

Vx x x vy Vx 1-^ A ' X A 




94S19 


RTA0000°9 19F m 16 1 P Sea 


F 

A 


M00033 9 1SCF07 

!▼ a vy vy vy^y 1 v v * a vy ' 


CH0SLNH 

X_> 4 A Vx vy * — * * » 4 ■ 


191 


8576 


RTA00002890F.h. 13. LP.Seq 


F 


M00001616D:F03 


CH01COH 


192 


12056 


RTA00002893F.g.l2.l.P.Seq 


F 


M000039()0C:D12 


CH01COH 


193 


895 


RTA0000292 lFb. 1 1. LP.Seq 


F 


M00033303C.F09 


CH09LNL 


194 


7212 


RTA00002S97F.J.04. LP.Seq 


F 


M00004270A.E09 


CH0ICOH 


195 


108296 


RTA00002907F.h.20. 1 P.Seq 


F 


M00022193C:C09 


CH03MAH 


196 


115713 


RTA00002906F.a.22. 1 .P.Seq 


F 


M00021S52C:H02 


CH03MAH 


197 


7334 


RTA000029 10F.1.08. 1 .P.Seq 


F 


M00023004C:A0l 


CH03M.AH 


198 


1090 


RTA000029 18F.g.20.2.P.Seq 


F 


M00032S92C:C12 


CHOSLN'H 


199 


7913 


-RTA00002SS6F.]. 13. 1 .P.Seq 


F 


M00001362A.F09 


CH01COH 


200 


12139 


RTA00002923F.O.02. 1 .P.Seq 


F 


M00039349D:B11 


CH09LNT 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


201 


17148 


RTA00002S95F.I.14. l.P.Seq 


F 


M00004102A:E03 


CH01COH 


202 


2314 


RTA00002S9 1 F. 10. 1 .P.Seq 


F 


M00001771B:E06 


CHOICOH 


203 


18660 


RTA00002903F.m.03. 1 .P.Seq 


F 


M00007048CA12 


CH02COH 


204 


21799 


RTA0O0O2900F.n.O9. l.P.Seq 


F 


M00005385D:F07 


CH02COH 


205 


16612 


RTA00002893F. j. 17. l.P.Seq 


F 


M00003915CD10 


CHOICOH 


206 


168067 


RTA00002909F.a. 17. l.P.Seq 


F 


M00022537B:C06 


CH03MAH 


207 


21197 


RTA0000239 1 F.c. 1 9. 1 .P.Seq 


F 


M00001680A:A0l 


CHOICOH 


208 


45015 


RTA00002905F.0. 17. l.P.Seq 


F 


M00021678D:H04 


CH03MAH 


209 


15381 


RTA00002903F.m. 19. LP.Seq 


F 


M00007070OC01 


CH02COH 


210 


1402 


RTA00002396F.i.04. LP.Seq 


F 


M00004166CB10 


CHOICOH 


211 


69026 


RTA00002893FX05. LP.Seq 


F 


M00003887CE09 


CHOICOH 


212 


33119 


RTA00002922F.L07. LP.Seq 


F 


M00039067A:C05 


CH09LNL 


213 


1166 


RTA00002391F.C.05. LP.Seq 


F 


M00001677B:H08 


CHOICOH 


214 


14345 


RTA00002891F.b.06. LP.Seq 


F 


M00001671C:F03 


CHOICOH 


215 


10589 


RTA00002908F.d. 1 6. LP.Seq 


F 


M00022392B:F01 


CH03MAH 


216 


13281 


RT A000029 1 6F.p.0 1 . 1 .P.Seq 


F 


M00032652C:C07 


CH08LNH 


217 


132S1 


RT A000029 16F.o.24. LP.Seq 


F 


M00032652C:C07 


CH08LNH 


218 


12248 


RTA0OOO2S9SF.h.O8. LP.Seq 


F 


M00004351B:G07 


CHOICOH 


219 


164955 


RTA00002909F.n.09. l.P.Seq 


F 


M00022706D:G03 


CH03MAH 


220 


15686 


RTA00002907F.f.24. LP.Seq 


F 


M00022170C:C0l 


CH03MAH 


221 


15686 


RTA00002907F.2.01. LP.Seq 


F 


M00022170OC01 


CH03MAH 


222 


34228 


RTA00002S9 lF.d. 14. 1 .P.Seq 


F 


M00001633B:Fll 


CHOICOH 


223 


6530 


RTA00002900F.p.04. LP.Seq 


F 


M00005409D:B02 


CH02COH 


224 


1964 


RTA00002903F.C. lO.LP.Sea 


F 


M00006S35A:F07 


CH02COH 


225 


10547 


RTA00002S93F.e. 16. 1 .P.Seq 


F 


M00003SS4A:E12 


CHOICOH 


226 


13393 


RTA0000290 1 F.a.06. 1 .P.Seq 


F 


M00005422B:B0S 


CH02COH 


227 


14309 


RTA00002915F.e.03.2.P.Seq 


F 


M0002S773C:C05 


CH08LNH 


228 


46S50 


RTA00002907F.e.2 1 . 1 .P.Seq 


F 


M0002215SB:B09 


CH03MAH 


229 


5398 


RTA0000291 IF.i. 13. LP.Seq 


F 


M00027004C:Cll 


CH04MAL 


230 


27569 


RTA000029 10F.1. 14. LP.Seq 


F 


M00023007D:D03 


CH03MAH 


231 


26277 


RTA00002S98F.2. 19. l.P.Seq 


F 


M00004347CA05 


CHOICOH 


232 


185914 


RTA000029 1 2F.k.0 1 . 1 .P.Seq 


F 


M00027467A:CO7 


CH04MAL 


233 


14274 


RTA00002395F.C.09. LP.Seq 


F 


M00004066D:G10 


CHOICOH 


234 


28396 


RTA00002907F.2.02. LP.Seq 


F 


M00022171A:F03 


CH03MAH 


235 


. 22991 


RTA000029 l6F.k.07. 1 .P.Seq 


F 


M00032614C:B10 


CHOSLNH 


236 


6321 


RTA00002908F.b. 17. 1 .P.Seq 


F 


M00022372D:H12 


CH03MAH 


237 


21322 


RTA00002903F.a. 17. LP.Seq 


F 


M00006861D:H10 


CH02COH 


23S 


8440 


RTA00002923F.k.09. LP.Seq 


F 


M00039302B:E10 


CH09LNL 


239 


14677 


RTA00002905F.C.05. 1 .P.Seq 


F 


M00007975D:FI2 


CH03MAH 


240 


135005 


RTA00002902F.d.23. l.P.Seq 


F 


M00006592A:A12 


CH02COH 


241 


5091 


RTA00002S95F.n. 19. l.P.Seq 


F 


M00004115A:G12 


CHOICOH 


242 


24760 


RTA00002923F.h. 13. LP.Seq 


F 


M00039270D:D02 


CH09LNL 


243 


21333 


RTA00002926F.t*.22.2.P.Seq 


F 


M00040123C:A10 


CH09LNL 


244 


12176 


RTA00002SS6F.2. 13.1 .P.Seq 


F 


MO000l353A:H07 


CHOICOH 


245 


14407 


RTA0000290IF.C.20. LP.Seq 


F 


M00005452D:E05 


CH02COH 


246 


186319 


RTAOO0O29 1 2F.d.24. 1 .P.Seq 


F 


M00027290CF06 


CH04MAL 


247 


30135 


RTA00002907F.i.2 L2.P.Seq 


F 


M0002220SC:F0S 


CH03MAH 


248 


33142 


RTA0000290 IF. 1.01. LP.Seq 


F 


M00005607B:C04 


CH02COH 


249 


33142 


RTA0000290lF.k.24. LP.Seq 


F 


M00005607B.C04 


CH02COH 


250 


16232 


RTA00002900F.n. 10. LP.Seq 


F 


M0O0O53S7A:B03 


CH02COH 



if 43 } 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


251 


23954 


RTA00002S94F.k.06. l.P.Seq 


F 


M00004036B:A11 


CHOICOH 


252 


12399 


RTA000029lSF.n. 10. l.P.Seq 


F 


M00032985D:G09 


CHOSLNH 


253 


30853 


RTA00002907F.j.l6.2.P.Seq 


F 


M00022216D:D10 


CH03MAH 


254 


8615 


RTA00002907F.1.17.2.P.Seq 


F 


M0002223SCG04 


CH03iMAH 


255 


142359 


RTA0OO029O5F.C. 10. l.P.Seq 


F 


M00007980B:A07 


CH03MAH 


256 


9565 


RTA00002926F.g.08.2.P.Seq 


F 


M00040127CD02 


CH09LNL 


257 


17334 


RTAOOO02902F.d.03. 1 .P.Seq 


F 


M00006582D:A09 


CH02COH 


258 


12540 


RTA00002SS6F.f.03. l.P.Seq 


F 


M00001349CB04 


CHOICOH 


259 


17289 


RTA00002926FT. 14.2. P.Seq 


F 


M00040113D:C05 


CH09LNL 


260 


46798 


RTA00002907F.i. l3.2.P.Seq 


F 


M00022202CC04 


CH03MAH 


261 


7797 


RTA00002905F.b.06. l.P.Seq 


F 


M00007953D:F07 


CH03MAH 


262 


1945 


RTA00002S96F.p.04. l.P.Seq 


F 


M00004201D:C03 


CHOICOH 


263 


6084 


RTA00002896F.O.02. l.P.Seq 


F 


M00004195A:F07 


CHOICOH 


264 


6091 


RTA00002930F.C.03. l.P.Seq 


F 


M00042915B:G11 


CH15C0N 


265 


186105 


RTA00002930F.C. 10. l.P.Seq 


F 


M00055430A:A01 


CH15CON 


266 


11341 


RTA00002930F.h.07. l.P.Seq 


F 


M00055961C:B10 


CH15C0N 


267 


2520 


RTA00002930F.e. 10. l.P.Seq 


F 


M00055639A:E06 


CH15C0N 


268 


136735 


RTA000O29O3F.k.06. l.P.Seq 


F 


M00007006C:C12 


CH02COH 


269 


8336 


RTA00002900F.e.20. 1 .P.Seq 


F 


M00004S73B:G04 


CH02COH 


270 


13926 


RTA00002907F.h. 19. l.P.Seq 


F 


M00022193B:A09 


CH03MAH 


271 


11119 


RTA00002906F.k.0 1 . 1 .P.Seq 


F 


M00022009CA0S 


CH03MAH 


272 


11119 


RTA00002906F.J.24. l.P.Seq 


F 


M00022009CA0S 


CH03MAH 


273 


11726 


RTA00002906F.1.07. l.P.Seq 


F 


M00022051B:D07 


CH03MAH 


274 


6799 


RTA00002925F.g.2 1 . 1 .P.Seq 


F 


M00039885C:D11 


CH09LNL 


275 


17266 


RTAOOO02389F.g.09. l.P.Seq 


F 


MO0001544B:B05 


CHOICOH 


276 


9479 


RTA00002924F.g.04. l.P.Seq 


F 


M00039560B:G09 


CH09LNL 


277 


185557 


RTA000029 12F. j. 13.1 .P.Seq 


F 


M00027457B:Ell 


CH04MAL 


278 


27872 


RTA00002906F.e. 14. l.P.Seq 


F 


M00021923D:H02 


CH03MAH 


279 


15513 


RTA00002924F.2.2 1. l.P.Seq 


F 


M0C039617C:A10 


CH09LNL 


280 


4446 


RTA00002S9 IF.m. 15.1 .P.Seq 


F 


M00003773A:F10 


CHOICOH 


281 


1681 


RTA000029 1 6F.g.07. l.P.Seq 


F 


M00032577D:F01 


CHOSLNH 


232 


24243 


RTA00002SS7F.n. 13. l.P.Seq 


F 


M00001424D:D02 


CHOICOH 


2S3 


16049 


RT.A00002900F.cl 1. l.P.Seq 


F 


M00004S46A:A10 


CH02COH 


284 


186267 


RTA000029 lOF.h. 1 1. l.P.Seq 


F 


M00022924B:.A05 


CH03MAH 


285 


4543 


RTA00002925F.h.22. l.P.Seq 


F 


M00039S95D:C04 


CH09LNL 


286 


6176 


RTA00002914F.d.23. l.P.Seq 


F 


M0002818SC:H11 


CHOSLNH 


287 


29043 


RTA00002906F.h. 17. l.P.Seq 


F 


M00021974D:F0l 


CH03MAH 


283 


696 


RTA00002922F.0. 15. l.P.Seq 


F 


M00039143A:F04 


CH09LNL 


289 


7225 


RTA00002S9 IF.1.22. 1 P.Seq 


F 


M00003770CAIO 


CHOICOH 


290 


25609 


RTA00002S99F.h. 15. l.P.Seq 


F 


M0000460SA:H04 


CHOICOH 


291 


6295 


RTA00002922F.O.24. l.P.Seq 


F 


M00039U6B:GO4 


CH09LNL 


292 


186319 


RT A000029 1 2F.e .0 1 . 1 . P . Seq 


F 


MOOO27290C:F06 


CH04MAL 


293 


4539 


RTA0000:889F.d.04. l.P.Seq 


F 


M00OO1534C:EO7 


CHOICOH 


294 


17841 


RTA00002S9 1 F.m.06. 1 P.Seq 


F 


M00003771D:A03 


CHOICOH 


295 


13720 


RTA000029:4F.c05. l.P.Seq 


F 


M00039430A:E04 


CH09LNL 


296 


7300 


RTA00002925F.a. 14. l.P.Seq 


F 


M00039S06B.D05 


CH09LNL 


297 


136280 


RTAOOO029l2F f. 13. l.P.Seq 


F 


M00027316C:C03 


CH04M.AL 


298 


185585 


RT A000029 1 2 F . n .04 . l.P.Seq 


F 


M00027569A:E05 


CH04MAL 


299 


3447 


•RTA00002900F.U 1. l.P.Seq 


F 


M00005364B:EtO 


CHOICOH 


300 


14487 


RTAO0002SS9F.f. 19. l.P.Seq 


F 


M00001542B:F09 


CHOICOH 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBR.ARY 


101 

JU l 


S118 

J JJO 


RTA0000 n 90lF s 16 1 P Sea 


F 


M00005505A:F01 


CH02COH 




7766 


RT\0000' 1 917F e 07. 1 P Sea 


F 


M00032700A:E09 


CH0SLNH 


301 


7450 


RTA0000" , 899F * 09 1 P Sea 


F 


M00004509D:C06 


CHOICOH 


304 


1 5369 


RTA0000^90SF m 1 8. LP Sea 


F 


M00022494D:A05 


CH03MAH 


105 


4954 


RTAOOOO^^F i 17 LP Sea 


F 


M00033150C:A11 


CH08LNH 


306 


1 7 1 89 

1 / IO/ 


RTAOOOO^OOF r. 1 1 . 1 .P Sea 


F 


M0OOO5333D:D08 


CH02COH 


307 


1 R6561 


RTAOOOO^PF m°3.1 PSea 


F 


M00027549C:G03 


CH04MAL 


10S 


Aa6.A< 

't'i KJ^J 


RT A000CPS96F h 2°. 1 P Sea 


F 


M00004165C:A11 


CHOICOH 


309 


1 1 TUT 


RT A 0000° 9"^ a 24. 1 .P Sea 


F 


M00039413C-E06 


CH09LNL 


3 10 


1S° 1 7 


RTA0000"'893F mAl P Sea 


F 


M00003942A:D01 


CHOICOH 


1 1 1 


77099 


RTA0000°S90F m 09 J P Sea 


F 

* 


M0000164SA:D10 


CHOICOH 


IP 


7S04 1 


RT A0000^S90F dP IP Sea 


F 


M00001661D:F06 


CHOICOH 


313 


18S918 


RTA0000^91 IF.d 01 LP Sea 


F 


M00027173C:E11 


CH04MAL 






RT AOOOO^QOSF 0 06 1 P Sea 


F 


M000' , 2509B:Dll 


CH03MAH 


IIS 
jij 


1 RS707 


RTA0000^91 IF 0 19 1 P Sea 


F 


M000 >> 7167C:B10 


CH04MAL 


' 1 16 

J ID 




RT A000TP9PF n 19 1 P Sea 


F 


M000275S6A:C09 


CH04MAL 


1 17 


7S70J. 


RTA0000^9 PFn^ IP Sea 


F 


M000°7589B:G07 


CH04MAL 


J lo 


7 1 0AQ 
Z 1 UOo 


RTA00007R96F h IS 1 P Sea 


F 

X 


M00004I64BEP 


CHOICOH 


J IV 


i Li /in 


RT A 000079 1 7F e IS 1 PSea 
tv i i\Ww-/ 1 1 r>c. 10.1 -i .kjci^ 


F 


M0003771 1 BF01 


CH08LNH 


ion 




RTAOOOO^^IF i IS 1 P Sea 


F 

X 


M00039°8'>BG04 


CH09LNL 




7 11Q1 


RT A0000^S96F 0 01 I P Sen 


F 

X 


M0000415SD:E0S 


CHOICOH 


V)7 


0/JJ 


RT A0000^9I SF 1 01 1 P Sea 


F 


M0003^944 -\*B07 


CH08LNH 


it; 




RT A 0000^907 F r ^>1 1 P Sen 


F 


MOOO^^^ISB BP 


CH03M.\H 


17J. 


8 1 no 


RT A0000' ,| S96F 0 ">1 1 P Sea 


F 

X 


M00004160DF06 


CHOICOH 


17S 




RT A0000?919F i 18 1 P Sea 

l\ 1 »^\JUvv» ✓ 1 ^ 1 .J • 1 • * •* .JVM 


F 


M000331S3B.F10 


CH0SLNH 


17A 




RTA0000°90°F k °3 1 P Sea 


F 


M00006743A:D04 


CH02COH 


177 


O-7J 


RTAOOOO°903F k ^3 1 P Sea 


F 

X 


M00007031 A:E02 


CH02COH 


*V?S 


t SA777 


RT AOOOO" , 907F 1 13 1 P Sea 


F 

X 


M00022237D-.D06 


CH03MAH 




777S I 


RTA»^O0O°S97F I 15 1 P Sea 


F 


M000042S2C:A12 


CHOICOH 


^in 


7Q£G 


RT AOOOO^O 1 7F i 15 1 P Sea 


F 


M00032"49D:G01 


CH0SLNH 


j j t 


I JOUU7 


RT A0000^907F k 05 1 P Sea 


F 

X 


M000' > ' ,>> 20A:A07 


CH03M.\H 




0/1^1 


RT AOOOO°907F k ">1 1 P Sea 


F 

* 


M0002222SB:BU 


CH03M.-KH 


VII 


1 ^6ns7 

i oDUJ J- 


RTAOOOO^OPF h OS 1 P Sea 


F 

* 


M000' 7 7364B:E12 


CH04M.AL 




007 


RT\0O0O' > 917F f ^ 1 PSea 


F 


M00032723D:H02 


CH08LNH 


jjj 


1 1 609 


RT \0000'>S99F f ^3 1 P Sea 


F 


M00004507D-.E03 


CHOICOH 


116 


1 R607 S 


RTA0000°91 lFk 19 1 P Sea 


F 

X 


MO0O27O57C:Dl0 


CH04MAL 


117 




RTA0O0O°9 1 1 F 1 *>0 1 P Sea 


F 

X 


M000270S1A:A0S 


CH04M.AL 


JJO 


1 tain 


RT AOO00^S9^F e 07 ^ P Sea 


F 


MOOOO3S08B:E0 T 


CHOICOH 


110 


I0J7J0 


RT AOOOO^O 1 1 F 0 ^4 I P Sea 


F 

X 


M000°7173C:Etl 


CH04M.\L 


^40 




RTAOOOO^OISF m 15 1 P Sea 


F 


M00032497D:B iO 


CH08LNH 


341 


186588 


RTA000029llF.L03.LP.Seq 


F 


M00027064B:D06 


CH04M.\L 


342 


23174 


RTA00002909F.C. 17. 1 .P.Sea 


F 


M00022600D:B05 


CH03MAH 


343 


4727 


RTA00002905F.2- 19. LP.Seq 


F 


MO0O0SO59D:30S 


CH03M.\H 


344 


17048 


RTA00002SS7F.L 10. LP.Seq 


F 


MOOOOI416B:A05 


CHOICOH 


345 


2354 


RTA000029 16F.O.03. LP.Seq 


F 


M00032645D:C01 


CH08LNH 


346 


19S7 


RT A0000:S94F.a. 1 3. 1 .P.Seq 


F 


M00003974D;E0: 


CHOICOH 


347 


244S3 


RT A00002 S9 7 F . i . 2 1 . 1 .P.Seq 


F 


M00004269A:Gi i 


CHOICOH 


34S 


33337 


RTA00002S96F.t'.08. LP.Seq 


F 


M00004 1 55 A :H03 


CHOICOH 


349 


11641 


RT A000029 1 6F.ni. 1 9. 1 .P.Seq 


F 


M00032637A:F09 


CHOSLNH 


350 


10307 


RT AOOOO 2 9 10F.I.0L LP.Seq 


F 


M00022995C :G0" 


CH03MAH 



1(5 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBR.ARY 


351 


20388 


RTA00002906F.a.04. l.P.Sea 


F 


M0OO2l7O0D:H03 


CH03MAH 


352 


24687 


RTAOOOO2903F.m.02. l.P.Seq 


F 


M00007048B:Ell 


CH02COH 


353 


10414 


RTA000029 19Rn. 19. l.P.Seq 


F 


M00033232B:C08 


CH08LNH 


354 


11058 


RTA00002892F.h.l6.2.P.Seq 


F 


M00003820B:Fll 


CH01COH 


355 


6574 


RTA000029 1 7F.o. 1 7. 1 .P.Seq 


F 


M00032797D:D08 


CH08LNH 


356 


18782 


RTA000O2905F.f.07. l.P.Seq 


F 


M00008021C:G12 


CH03MAH 


357 


35896 


RTA00002896F.d.04. 1 P.Seq 


F 


M00004146CB04 


CH01COH 


358 


3518 


RTA00002930FJ. 10. l.P.Seq 


F 


M00056217D:E10 


CH15CON 


359 


8820 


RTA00002915F.f.l7.2.P.Seq 


F 


M00028782A:F01 


CH08LNH 


360 


10208 


RTA00002897F.h.08. l.P.Seq 


F 


M000O4251D:DO3 


CH01COH 


361 


20S9 


RTA00002S96F.g. 14. l.P.Seq 


F 


M00004159D:F12 


CH01COH 


362 


170919 


RTA00002909F.p.03. LP.Seq 


F 


M00022727A:G01 


CH03MAH 


363 


8727 


RTA00002917F.O.02. l.P.Seq 


F 


M00032791B:H11 


CH08LNH 


364 


33184 


RTAO0002898F.d.0S. l.P.Seq 


F 


M00004324A:DIO 


CHOICOH 


365 


27973 

mm ' w 


RTA00002905F.2. 13. l.P.Seq 


F 


M00008055D:G03 


CH03NLAH 


366 


15835 


RT A00002897F.k. 1 3. l.P.Seq 


F 


M00004278C.B10 


CHOICOH 


367 


10273 


RTA00002903F.n.03. 1 .P.Seq 


F 


M00007081B:E09 


CH02COH 


368 


283° 

W -J w 


RT <\00002S99F.f.03. 1 .P.Seq 


F 


M00004502A:D12 


" CHOICOH 


369 


*J ^\J~m — 


RT A00002903 F.m.P.l.P.Seq 


F 


M00007060D:G07 


CH02COH 


370 


68176 


RTA00002893F.2. 1 l.l P.Seq 


F 


M000O3898C:A01 


CHOICOH 


371 


2937S 


RTA0000^9 15F.n. 14.2.P.Seq 


F 


M00032508A:E03 


CH08LNH 


37° 


23235 


RT^00002925F.k.02. l.P.Seq 


F 


M00039929B:E06 


CH09LNL 


373 


Pill 


RTA00002895F.0. 17.1 .P.Seq 


F 


M00004122C:D01 


CHOICOH 


374 


5737 


RTA0000^9°4F.k.02. 1 .P.Seq 


F 


M00039672C:D05 


CH09LNL 


375 


7^475 


RTA000029 15F.1. 15. 1 .P.Seq 


F' 


M00032490D:E0S 


CH08LNH 


376 


70^7 


RTA00002907F.O.0 1. 1 .P.Seq 


F 


M00022264A:B02 


CH03MAH 


377 

***** 


17165 


RTA00002903F.d. 19. 1 .P.Seq 


F 


M00006907A:C09 


CH02COH 


378 


26446 


RTA00002894F.m. 17. l.P.Seq 


F 


M00004047C:B09 


CHOICOH 


379 


6755 


RTA0000291SF.lc.24. 1 P.Seq 


F 


MC0032944A:B07 


CHOSLNH 


380 


9336 


RTA00002909F.n.02. 1 P.Seq 


F 


M00022703D:Bll 


CH03M.AH 


381 


6960 


RTA000029 1 6F.O.0S. 1 .P.Seq 


F 


M00032647B:F06 


CHOSLNH 


38^ 


472 


RTA000029 1 1 F.g.0 1 . 1 P.Seq 


F 


M00026936D:C07 


CH04MAL 


383 


9460 


RTA00002908F.C.03. l.P.Seq 


F 


M00022376D:D05 


CH03MAH 


384 


10307 


RTA00002910F.k.24. l.P.Seq 


F 


M00022995C:G07 


CH03MAH 


385 

V V *X 


46^3 


RTA00002923F.d.22. l.P.Seq 


F 


M00039222B.A04 


CH09LNL 


386 


141 167 


RTA00002905F.C.09. 1 .P.Seq 


F 


M00007980A:B01 


CH03M.AH 


387 


3401 1 


RTA00002898F.m. 10. l.P.Seq 


F 


M00004385C:H12 


CHOICOH 


388 


5965 


RTA00002915F.a.07. l.P.Seq 


F 


M0002S620C:C07 


CHOSLNH 


389 


P336 


RTA00002915F.2.04. l.P.Seq 


F 


M00O287S4A:D12 


CHOSLNH 


390 


36492 


RTA00002S93F.f. IS. l.P.Seq 


F 


M00003891B:H02 


CHOICOH 


391 


29803 


RTA00002908F.k.06. l .P.Seq 


F 


M00022467D:B03 


CH03MAH 


392 


4420 


RTA00002920F.a. 15.1 P.Seq 


F 


M00033326B:B05 


CHOSLNH 


393 


15097 


RTAOO002923F.b.O6. 1 .P.Seq 


F 


M00039l75A:F0t 


CH09LNL 


394 


19133 


RTA00002S94F.g.03. l.P.Seq 


F 


M00003993CD07 


CHOICOH 


395 


9810 


RTA00002905F.C.03. 1 P.Seq 


F 


M0O0O7975C:Al0 


CH03M.AH 


396 


31562 


RTA00002S97F.3.09. l.P.Seq 


F 


M00004210A.A03 


CHOICOH 


397 


1499 


RTA000029 1 2F.k. 12. 1 P.Seq 


F 


M00027475D:AOl 


CH04MAL 


398 


29531 


RTA00002907F.O.05. 1 P.Seq 


F 


M00022265A:F11 


CH03M.AH 


399 


4287 


RTA000029 1 SF.j.20. 1 P.Seq 


F 


M0003292SC:D02 


CHOSLNTI 


400 


28660 


RTA00002905F.p. li.l P.Seq 


F 


M00021690A:C03 


CH03M.AH 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


401 


4596 


RTA0O0O2898F.f.2 1. l.P.Seq 


F 


M00004341CE05 


CH01COH 


402 


21774 


RTA000O2898F.C.2O. l.P.Seq 


F 


M00004322B:D03 


CH01COH 


403 


5611 


RTA000029 15F.e. 12. l.P.Seq 


F 


M00028774D:E10 


CH08LNH 


404 


7030 


RTA0O0O2894F.I. 13. 1 P.Seq 


F 


M00004042B:A11 


CH01COH 


405 


11736 


RTA00002898F.C.09. l.P.Seq 


F 


M00004330A;A01 


CH01COH 


406 


94732 


RTA000029 1 OF.e. 17.1 .P.Seq 


F 


M00022S56D:A07 


CH03MAH 


407 


30283 


RTA00002923 F*. 19. 1 .P.Seq 


F 


M00039255D:B0l 


CH09LNL 


408 


129779 


RTA00002904F.a. 1 8. 1 .P.Seq 


F 


M00007155CD07 


CH02COH 


409 


4635 


RT A00002900F. j.2 1 . 1 .P.Seq 


F 


M00005349C:C02 


CH02COH 


410 


5879 


RTA00002893F.f.08.1.P.Scq 


F 


M000038S8B:F09 


CH01COH 


411 


119206 


RTA00002905F.m. 1 6. 1 .PSeq 


F 


M00021650D:A11 


CH03MAH 


412 


6946 


RT A000029 30F.2. 19.2. P.Seq 


F 


M00055880B:H10 


CH15C0N 


413 


42462 


RT A00002902F. f. 12. 1 .P.Seq 


F 


M00006631D:D02 


CH02COH 


414 


24285 


RT A00002 8 96F. m . 1 7 . LP. Seq 


F 


M00004189A:C12 


CHOICOH 


415 


13769 


RTA0000290 1 F.a. 1 7. 1 .P.Seq 


F 


M00005423CD07 


CH02COH 


416 


17039 


RTA00002896F.i. 14. 1 .P.Seq 


F 


M00004169A:E04 


CHOICOH 


417 


14397 


RTA00002896FJ.1 L l.P.Seq 


F 


M00004172D:B12 


CHOICOH 


418 


14351 


RTA00002888F.C.2 1 - 1 .P.Seq 


F 


M00001444C:Dll 


CHOICOH 


419 


5579 


RTA00002893F.j. 1 1 .l.P.Seq 


F 


M00003914A:A08 


CHOICOH 


420 


24186 


RTA000029 14 F. n.02. 1 .P.Seq 


F 


M00028366B:B08 


CHOSLNH 


421 


11433 


RTA0000292 1 F.c.06. 1 -P.Seq 


F 


M0OO33342B.F03 


CH09LNL 


422 


186635 


RTA0000291 lF.t*.06.LP.Seq 


F 


M00026907D:E07 


CH04MAL 


423 


5955 


RTA000029 1 5F.d. 1 S. 1 .P.Seq 


F 


M00028771A:E02 


CHOSLNH 


424 


22053 


RTA00002S94F.k.09. 1 .P.Seq 


F 


M00004036D:C12 


CHOICOH 


425 


9259 


RTA000029 18F.b.09. l.P.Seq 


F 


M00032S30D:D02 


CHOSLNH 


426 


25437 


RTA00002905F.O.23. 1 .P.Seq 


F 


M00021681C:C09 


CH03MAH 


427 


8488 


RTA000029 16F.i.02.LP.Seq 


F 


M00032590B:H01 


CHOSLNH 


428 


4884 


RTA000029 19F.0. 12. 1 .P.Seq 


F 


M0003324SD:H11 


CHOSLNH 


429 


9804 


RTA000029 1 5F.e. 19. 1 .P.Seq 


F 


M00028764B:D03 


CHOSLNH 


430 


179954 


RTAOO0029 10F,j.04. 1 .P.Seq 


F 


M00022964A:B03 


CH03MAH 


431 


186532 


RTA000029 1 2F.a.0 1 . 1 .P.Seq 


F 


M00027189CB10 


CH04MAL 


432 


11015 


RTA00002S94F.L 1 5. 1 .P.Seq 


F 


M00004029D:A01 


CHOICOH 


433 


8824 


RTA0000:903F.b.l7.LP.Seq 


F 


M00006877B:C09 


CHOICOH 


434 


4063 


RT A000029 1 6F.k.0 1 . 1 .P.Seq 


F 


M00032613A:Ell 


CHOSLNH 


435 


7964 


RTA00002896F.i. 1 8. 1 .P.Seq 


F 


M00004170A:F03 


CHOICOH 


436 


9238 


RTA0O0O 29 1 5 F. j .20. 1 .P.Seq 


F 


M00032473B:A03 


CHOSLNH 


437 


2841 


RT A000029 1 4F. t*. 15.1 .P.Seq 


F 


M00028196A:G03 


CHOSLNH 


438 


11203 


RT A00002 SS6F.p. 16. l.P.Seq 


F 


M000013S2D:H08 


CHOICOH 


439 


8800 


RTA00002SSSF.C.20. l.P.Seq 


F 


M00001444B:E04 


CHOICOH 


440 


3224 


RTA000029 1 6F.d.23. 1 -P.Seq 


F 


M00032556D-.A03 


CHOSLNH 


441 


95423 


RT A00002909F.k.24. 1 .P.Seq 


F 


M00022674C:H08 


CH03MAH 


442 


7911 


RTA00002926F.C.1 l.2.P.Seq 


F 


M00040079D:D09 


CH09LNL 


443 


88052 


RTA00002925F.p. 1 1 . 1 .P.Seq 


F 


M00040041D:F01 


CH09LNL 


444 


32736 


RTAOOOOZ900F.L20. l.P.Seq 


F 


M00005367D:All 


CHOICOH 


445 


20811 


RTA00002S96F.n. 14. l.P.Seq 


F 


M00004192C:B06 


CHOICOH 


446 


12856 


RTAO00O29OSF.b.O7. 1 P.Seq 


F 


M0002236SA:3 1 1 


CH03MAH 


447 


12190 


RTA00002S99F.b. 10. 1 .P.Seq 


F 


M00004430B:3!0 


CHOICOH 


448 


10546 


RTA0000290 1 F.o.OS. l.P.Seq 


F 


M000056S9C:B02 


CHOICOH 


449 


21041 


RTA00002S9SF.k.0S. l.P.Seq 


F 


M00004372A:E12 


CHOiCOH 


450 


16484 


RTA00002S94F.C.04. 1 .P.Seq 


F 


M00003979B.A04 


CHOICOH 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLOiVE ID 


LIBRARY 


451 


7741 


RTA00002S94F.L08. 1 .P.Seq 


F 


M0000402SB:FIO 


CH01COH 


452 


14921 


RTAOOOO:926Fc.l5.2.P.Seq 


F 


M000400S1C:E02 


CH09LNL 


453 


17571 


RTA00002900F.m. 1 6. 1 .P.Seq 


F 


M00005375D:AIO 


CH02COH 


454 


46881 


RT A0000290 1 F.i.20. 1 . P. Seq 


F 


M00005622A;H02 


CH02COH 


455 


21533 


RTA00002S9SF. j. 10. 1 .P.Seq 


F 


M00004365C-.C09 


CH01COH 


456 


19010 


RTA000029 l6F.k.08. L.P.Seq 


F 


M00032614D:D08 


CH08LNH 


457 


48768 


RT A00002 8 86F. n .0 1 . 1 .P.Seq 


F 


M00001374CB10 


CHOICOH 


458 


7515 


RTA00002892F.p.22.2.P.Seq 


F 


M00003S55CF02 


CH01COH 


459 


17326 


RTA00002898F.h.02.1. P.Seq 


F 


M00004350A:A04 


CHOICOH 


460 


3902 


RTA0000290lF.d.l7.1.P.Seq 


F 


M00005460D:C11 


CH02COH 


461 


12400 


RTA0000290 1 F.d. 1 8. 1 .P.Seq 


F 


M00005461A:D12 


CH02COH 


462 


186543 


RTA000029 12F.a.06. l.P.Seq 


F 


MO0027193C:A07 


CH04MAL 


463 


4063 


RT A000029 1 6F . j . 24 J . P. Seq 


F 


M00032613A.EII 


CHOSLNH 


464 


6267 


RT A000029 1 OF.d.20. 1 .P.Seq 


F 


M00022835C:A09 


CH03MAH 


465 


21349 


RTA00002901 F.c.04, 1 P.Seq 


F 


M00005445D:FU 


CH02COH 


466 


1123 


RT A00002S94 F. i .24 . t .P.Seq 


F 


M00004031C:G06 


CHOICOH 


467 


4401 


RTA0000:9l8F.m.lS.l.P.Seq 


F 


M00032979D:C11 


CH08LNH 


468 


15255 


RTA00002925F.p. 10. l.P.Seq 


F 


M00040041A:GOS 


CH09LNL 


469 


10991 


RTA00002933F.a. 15. l.P.Seq 


F 


M00043077C:G10 


CH19C0P 


470 


48768 


RTA00002886F.m.24. l.P.Seq 


F 


M00001374CB10 


CHOICOH 


471 


20406 


RTA00002900F.C.20. 1 .P.Seq 


F 


M00004S52D:C06 


CH02COH 


472 


39784 


RTA00002SS6F.S.05. l.P.Seq 


F 


M00001352B:B02 


CHOICOH 


473 


36567 


RTA000023S6F.n.06. l.P.Seq 


F 


M00001375B:D04 


CHOICOH 


474 


14317 


RT A00002902F.a. 1 3. l.P.Seq 


F 


M00005771D-.C02 


CH02COH 


475 


156277 


RTAG0002907F.l.l3.2.P.Seq 


F 


M00022237D.D06 


CH03MAH 


476 


6898 


RT A00002907 F.a.22 . 1 .P.Seq 


F 


M00022104A:G08 


CH03MAH 


477 


17376 


RTA00002902F.C.03. 1 .P.Seq 


F 


M00005819D:F09 


CH02COH 


478 


186535 


RTA000029 1 2F.d. 1 2. 1 .P.Seq 


F 


M00027270A:D04 


CH04M.AL 


479 


91616 


RTA000O:9 1 OF.b.24. 1 .P.Seq 


F 


MO0O22S12A:G0i 


CH03MAH 


480 


91616 


RTA0O0O29 lOF.c.Ol. l.P.Seq 


F 


M00022S12A:G01 


CH03MAH 


481 


6993 


RTA00002396F.j. 12. l.P.Seq 


F 


M00004172D:F04 


CHOICOH 


482 


12443 


RT A000029 16F.a.20. 1 .P.Seq 


F 


M00032534B:E12 


CH03LNH 


433 


2S585 


RTA0000290lF.j. 16. l.P.Seq 


F 


M00005570.V.D05 


CH02COH 


484 


9453 


RT A00002907F.k.2 1 .2.P.Seq 


F 


M0002222SB:B11 


CH03M.AH 


4S5 


156009 


RTA00002907F.k.05.2.P.Seq 


F 


M00022220A:A07 


CH03MAH 


486 


5958 


RTA0000290SF.n.22.2.P.Seq 


F 


M00022507CC08 


CH03MAH 


487 


155939 


RTA000O29O7F.j.23.2.P.Seq 


F 


M0002221SB:B12 


CH03MAH 


488 


16695 


RTA00002SS6F.g.22. l.P.Seq 


F 


M00001353D:E05 


CHOICOH 


489 


101 IS 


RT A00002 SS6F.h. 1 3. l.P.Seq 


F 


M00001356D:E06 


CHOICOH 


490 


13288 


RT A00002930F.b.2 1 . 1 .P.Seq 


F 


M00042S91CG08 


CH15C0N 


491 


3210 


RTA000029 10F.h.22. 1 .P.Seq 


F 


M00022945A:H09 


CH03MAH 


492 


15014 


RTA0000:934F.a.l8. l.P.Seq 


F 


M0004352SAE11 


CH20COHLV 


493 


22087 


RT AOOOO 2 S90F . 1. 1 9 . 1 P.Seq 


F 


M00001624A.C01 


CHOICOH 


494 


31948 


RTAQOOO2903F.i. 12. 1 .P.Seq 


F 


M00022454C:B08 


CH03MAH 


495 


11593 


RTA00002906F.p.21. l.P.Seq 


F 


M00022094B:G02 


CH03MAH 


496 


3131 


RTA0000:90SF.m. 1 7. 1 .P.Seq 


F 


M00022494B:D06 


CH03M.AH 


497 


151263 


RT.A00002906FJ.2 1. l.P.Seq 


F 


M00021 1 )9ID:F09 


CH03MAH 


498 


177542 


RTA00002910F.h. 23. l.P.Seq 


F 


M00022945B:F11 


CH03MAH 


499 


9738 


*RTA0000:924F.f.23. l.P.Seq 


F 


M00039559B:C07 


CH09LNL 


500 


15313 


RTA00002925F.f.05. l.P.Seq 


F 


M00039So5A:C09 


CH09LNL 



us 
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SEQ 
ED 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBR.\RY 


501 


19724 


RTA00002906F.p.05. 1 .P.Seq 


F 


M0002208SD:E 10 


CH03MAH 


502 


10731 


RTA00002S93F.m.l LI. P.Seq 


F 


M00003938C:A05 


CH01COH 


503 


10257 


RTA00002901F.h.09. 1 .P.Seq 


F 


M00005512B:H01 


CH02COH 


504 


186468 


RTA00002913F.b. IS. 1 P.Seq 


F 


M00027746A:D06 


CH04MAL 


505 


14736 


RTA00002908F.g.22. 1 .P.Seq 


F 


M00022436CF11 


CH03MAH 


506 


33267 


RTA000028S9F.h. 14. 1 .P.Seq 


F 


M00001548B:D06 


CH01COH 


507 


7719 


RT A00002908F.e. 1 1 . 1 .P.Seq 


F 


M00022403OE12 


CH03MAH 


508 


185539 


RT A000029 1 3F.b.03. 1 .P.Seq 


F 


M00027717CC06 


CH04MAL 


509 


14825 


RTA00002924F. f. 19.1. P.Seq 


F 


M00039556CG05 


CH09LiNL 


510 


3917 


RTA00002906F.p. 15. 1 .P.Seq 


F 


M00022092D:A11 


CH03MAH 


511 


18718 


RT A000O2895F.h.O5. 1 .P.Seq 


F 


MOO0O4O85B:HO2 


CH01COH 


512 


186762 


RTA00002918F.b.l 1.1. P.Seq 


F 


M00032831A:C07 


CHOSLNH 


513 


2732 


RTA00002925F.L07. 1 .P.Seq 


F 


MOO0399O0B:GO4 


CH09LNL 


514 


7684 


RTA00002924F j. 17.1 .P.Seq 


F 


M00039668OF01 


CH09LNL 


. 515 


6852 


RTA00002922F.C.22. l.P.Seq 


F 


M00039001A:B10 


CH09LNL 


516 


1422 


RTA00002924F.C.09. 1 .P.Seq 


F 


M00039433C:E03 


CH09LNL 


517 


5560 


RTA0O0029 17F.i.0 1 . 1 .P.Seq 


F 


M00032734C:C05 


CHOSLNH 


518 


48734 


RTA00002908F.1.23. 1 .P.Seq 


F 


M00022487C:C02 


CH03MAH 


519 


10486 


RTA00002899F.g.07. 1 .P.Seq 


F 


M00004509B:B10 


CHOICOH 


520 


33514 


RTA00002890F. j.03. 1 .P.Seq 


F 


M00001626A:D07 


CHOICOH 


521 


5821 


RTA0O0029 17F.m.01 . 1 .P.Seq 


F 


M00032772D:D03 


CHOSLNH 


522 


5821 


RT A000029 1 7F.L24. 1 .P.Seq 


F 


M00032772D:D03 


CHOSLNH 


523 


21940 


RTA00002896F.a.03. 1 .P.Seq 


F 


M00004134A:A0S 


CHOICOH 


524 


185724 


RTA0O0O29 1 2F.m.08. 1 .P.Seq 


F 


M00027523A:H05 


CH04MAL 


525 


182887 


RT A000029 10F.k.2 1 . 1 .P.Seq 


F 


M00022992A:H06 


CH03MAH 


526 


21346 


RTA00002901F.g.24. 1 -P.Seq 


F 


M00005507B:A03 


CH02COH 


527 


5501 


RTA00002S87F.n. 12. 1 .P.Seq 


F 


M00001424B:H06 


CHOICOH 


528 


13961 


RTA00002892F.j.l4. 1 .P.Seq 


F 


M00003828A:Dll 


CHOICOH 


529 


167S4 


RTA00002S86F.a.09.1. P.Seq 


F 


M0000133SD:DOI 


CHOICOH 


530 


1762S 


RTA000029 16FX 10. 1 .P.Seq 


F 


M00032568B:F08 


CHOSLNH 


531 


3304 


RT A00002S9SF.d.05. 1 .P.Seq 


F 


M00004324A:B03 


CHOICOH 


532 


14895 


RTA00002901F.g.l4.1. P.Seq 


F 


M00005504C:F12 


CH02COH 


533 


16036 


RT A0000289 1 F.k.09. 1 .P.Seq 


F 


M00003763B:BIO 


CHOICOH 


534 


23877 


RTA00002S9 1 F.k. 1 5 . 1 . P .Seq 


F 


M00003764B:Ftl 


CHOICOH 


535 


186784 


RTA00002930F.L 17. 1 .P.Seq 


F 


M00056105A:D06 


CH15C0N 


536 


13591 


RTA00002901FT. 15. 1 .P.Seq 


F 


M00005485C:H04 


CH02COH 


537 


17916 


RTA00002906F.p.08. 1. P.Seq 


F 


M00022090B:A10 


CH03MAH 


538 


40594 


RT A00002 897F. i . 1 5 . 1 . P . Seq 


F 


M00004266B:F07 


CHOICOH 


539 


9677 


RTA00002925F.L21 . 1 .P.Seq 


F 


M00039915B:E08 


CH09LNL 


540 


7736 


RTA00002887F.e.03. 1 .P.Seq 


F 


M00001393B:C03 


CHOICOH 


541 


2474 


RTA000O2917F.e.l5.1.P.Seq 


F 


M00032707D:F08 


CHOSLNH 


542 


23S10 


RTA00002S92F.L06. 1 .P.Seq 


F 


. M00003822D:A02 


CHOICOH 


543 


24633 


RT A00002907F. i A 9 . 1 . P . Se q 


F 


M00022208B:D03 


CH03MAH 


544 


72081 


RTA0OO02925F.k.03. 1 .P.Seq 


F 


M00039929D:HIO 


CH09LNL 


545 


5991 


RT A000029 1 6F. i . 1 7. 1 . P . Seq 


F 


M00032597A:H02 


CHOSLNH 


546 


14596 


RTA000029 1 1 F.n. 15. 1 .P.Seq 


F 


M00027131A:B03 


CH04M.AL 


547 


6923 


RTA00OO:$96F.d.0 1 . 1 .P.Seq 


F 


M00004146B:E08 


CHOICOH 


548 


6923 


RTA0000:S96F.c.24.1. P.Seq 


F 


M00004146B:E08 


CHOICOH 


549 


21851 


RTA00002SS7F.d09. 1 .P.Seq 


F 


M00001391D:D03 


CHOICOH 


550 


3935 


RTA0O0O:925F.j.O8. 1 . P.Seq 


F 


M00039921C:HI1 


CH09LNL 
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WO0102568 ffile://C:\Documents and Settings\Lawrence\Local Setting sYTemporary Internet Files\OLK11\WO0102568.cpcl 



Page 121 of 1046 



WO 01/02568 PCT/US00/18374 



SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ED 


LIBRARY 


551 


13328 


RTA00002909F.h.08.1.P.Sea 


F 


M00022634B:H09 


CH03MAH 


552 


2492 


RTA00002S87F.e. 1 l.l.P.Seq 


F 


M00001393D:E02 


CH01COH 


553 


11960 


RTA000029 1 7F.b.03. 1 .P.Seq 


F 


M00032671B:D08 


CHOSLNH 


554 


186084 


RTA000029 12F.f. 1 8. 1 .P.Seq 


F 


M00027319D:F07 


CH04MAL 


555 


13644 


RTA00002925F.a.09. 1. P.Seq 


F 


M0O039SO5B.BO6 


CH09LNL 


556 


5707 


RTA00002909F.k.l3.l.P.Seq 


F 


M00022672C:H04 


CH03MAH 


557 


95700 


RTA0000291 IF.p. 14. 1. P.Seq 


F 


M00027182B:GO6 


CH04MAL 


558 


342 


RTA00002922F.L23. 1 .P.Seq 


F 


M00039076D-.G04 


CH09LNL 


559 


8481 


RTA0OO02S87F.C. 12. l.P.Seq 


F 


M00001389D:D06 


CH01COH 


560 


12575 


RTA000029 16F.L 12. 1 .P.Seq 


F 


M00032594C:F05 


CH08LNH 


561 


40712 


RTA0000292 1 F.d.08. 1. P.Seq 


F 


M00033359C.H05 


CH09LNL 


562 


10768 


RTA00002SS6F.d.24. l.P.Sea 


F 


M00001346B:G11 


CHOICOH 


563 


38781 


RTA00002889F.k.23. l.P.Seq 


F 


M00001559A:H09 


CH01COH 


564 


8790 


RTA00002SSSF.2.08. 1 .P.Seq 


F 


M00001461D:C10 


CHOICOH 


565 


10167 


RTA000029 l6F.k.22: l.P.Seq 


F 


M00032621A:Fll 


CH08LNH 


566 


13706 


RTA00002905F.e.2 1. l.P.Seq 


F 


M0O008Ol9B:A0l 


CH03MAH 


567 


124172 


RTA00002900F.a.09. 1 .P.Seq 


F 


M00004S24A:D12 


CH02COH 


568 


92126 


RTA000029 lOF.g. 12. 1 .P.Seq 


F 


M00022904CD04 


CH03MAH 


569 


5830 


RTA000029 16F.J.09. 1 .P.Seq 


F 


M00032605B:D09 


CHOSLNH 


570 


15154 


RTA00002SS6F.p. 13.1 .P.Seq 


F 


M00001382D:A07 


CHOICOH 


571 


25813 


RTA00002910F.i. 12. 1 .P.Seq 


F 


M00022952A:B02 


CH03MAH 


572 


17268 


RTA00002S86F.d.07. 1 .P.Seq 


F 


M00001344D:E08 


CHOICOH 


573 


13684 


RTA000029 15F. j.09. 1 .P.Seq 


F 


M00031485B:G05 


CHOSLNH 


574 


13460 


RTA00002S98F.f. 19. l.P.Seq 


F 


M00004341C:A09 


CHOICOH 


575 


25115 


RTA000029l9F.p. 18. l.P.Seq 


F 


M00033311B:G10 


CH08LNH 


576 


19949 


RTA00002905F.e. 17. l.P.Seq 


F 


M00008016B:E09 


CH03MAH 


577 


24266 


RTA000029 l7F.k.06. 1 .P.Seq 


F 


M00032759A:A03 


CHOSLNH 


578 


8243 


RTA00002901F.0. 17. 1 .P.Seq 


F 


MO00O57O3B:EO3 


CH02COH 


579 


■ 12576 


RTA00002900F.k.23. l.P.Seq 


F 


M00005359B:S0S 


CH02COH 


580 


28531 


RTA00002909F.C.04. l.P.Seq 


F 


M00022559D:G10 


CH03MAH 


581 


15153 


RTA00002894F.O.2 l.l.P.Seq 


F 


M00004054A:D03 


CHOICOH 


582 


9498 


RTA00002894F.e.04. l.P.Seq 


F 


M000039S5D:B02 


CHOICOH 


583 


48140 


RTA000029 UF.h. 1 3. 1 .P.Seq 


F 


M0002821 IA:F10 


CHOSLNH 


584 


7626 


RTA00002895F.b.04. 1 .P.Seq 


F 


M00004061B:E05 


CHOICOH 


585 


22668 


RTA00002S96F.p. 17. l.P.Seq 


F 


M00004204C:H0S 


CHOICOH 


586 


45691 


RTA00002908F.a.l l.l.P.Seq 


F 


M00022305A:B04 


CH03MAH 


587 


30429 


RTA00002904F.a. 19. 1 .P.Seq 


F 


M00007155D:C09 


CH02COH 


588 


46969 


RTA00002909F.2.02. l.P.Seq 


F 


M00022618C:E04 


CH03MAH 


589 


44030 


RTA00002900F.O.23. 1 .P.Seq 


F 


M00005405C:D01 


CH02COH 


590 


142548 


RTA00002905F.h. 10. 1 .P.Seq 


F 


M00008073A:DOl 


CH03MAH" 1 


59 L 


18455 


RTA00002905 F.g. 1 S. l.P.Seq 


F 


M0O008059B:F08 


CH03MAH 


592 


7501 


RTA00002S94F.S.05. l.P.Seq 


F 


M00003993D:B03 


CHOICOH 


593 


7280 


RTA00002S93F.n.22. l.P.Seq 


F 


M0O003959D:AO5 


CHOICOH 


594 


19339 


RTA00002S9SF.1. 12. l.P.Seq 


F 


M00004376D:A12 


CHOICOH 


595 


30194 


RTA00002922F.k.05. l.P.Seq 


F 


M00039100A.G04 


CH09LNL 


596 


32650 


RTA000029 1 1 F.i.05. 1 P.Seq 


F 


M00026994D:D07 


CH04MAL 


597 


10510 


RTA00002905F.d. i 7. 1 P.Seq 


F 


M0000S001B:F05 


CH03MAH 


598 


13539 


RTA00O02S98F.t.03. 1 P.Seq 


F 


M00004336A:A01 


CHOICOH 


599 


20149 


RTA000029 1 7F.O.03. 1 .P.Seq 


F 


M0003279lD:F0l 


CHOSLNH 


600 


12780 


RTA00002S9 1 F.e .00. 1 .P.Seq 


F 


M0000lbS6D:F06 


CHOICOH 



I Co 
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ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


L1BR.\RY 


601 


182479 


RTA000029 lOF.j. L 8. 1 .P.Seq 


F 


M00022972C:E05 


CH03MAH 


602 


14016 


RTA00002 92 3 F. n. 1 7 . 1 . R Seq 


F 


M000393-UCAU 


CH09LNL 


603 


76075 


RTA00002 890F.h. 23. l.P.Seq 


F 


M00001620B:A03 


CH01COH 


604 


9806 


RTA00002922F.j.05. 1 P.Seq 


F 


M0003907SD:C10 


CH09LiVL 


605 


9086 


RTA00002SS9F.k. 15.1 .P.Seq 


F 


M00001553A:E06 


CH01COH 


606 


2619 


RTA00002907F.0. 12. l.P.Seq 


F 


M00022269C.A04 


CH03M-VH 


607 


17517 


RTA00002907F.h.06. 1 .P.Seq 


F 


M000221S5A;B03 


CH03MAH 


608 


5089 


RTA000029i5Re.22.2.P.Seq 


F 


M00028777B:G04 


CH08LNH 


609 


6728 


RTA00002904F. b. 1 3. 1 .P.Seq 


F 


M00007178A:C02 


CH02COH 


610 


41149 


RTA00002899F.2.20. 1 .P.Seq 


F 


M00004603B:E02 


CH01COH 


611 


35017 


RT A00002 S92F. t'.03 . 2 . P. Seq 


F 


M00003812C:A03 


CHOICOH 


612 


7008 


RTA00002923F.a.08. l.P.Seq 


F 


M00039165D:C04 


CH09LKL 


613 


1 85545 


RTA000029 12F.k. 16. 1 .RSeq 


F 


M0002748GCE09 


CH04M.AL 


614 


17840 


RTA00002S92F.p. 15.2.P.Seq 


F 


M00003S54B:F07 


CHOICOH 


615 


185914 


RTA000029I2F.J.24. l.P.Seq 


F 


M00027467A:C07 


CH04MAL 


616 


6862 


RTA00002903F.b.08. 1 .P.Seq 


F 


M00006872D:B07 


CH02COH 


617 


20120 


RTA00002888F.C.24. l.P.Seq 


F 


M00001445B:F06 


CHOICOH 


618 


20120 


RTA000028S8F.d.0 1 . 1 .P.Seq 


F 


M00001445B:F06 


CHOICOH 


619 


13879 


RTA00002923F.d.02. 1 .P.Seq 


F 


M00039207A:F07 


CH09LNL 


620 


9330 


RTA000029 l5F.g. 16.1 .P.Seq 


F 


M00028786B:A04 


CH0SLx\H 


621 


21572 


RTA0000292 1 F.h. 1 9. 1 . P.Seq 


F 


M00033441A:B12 


CH09LNL 


622 


2943 


RTA000029 l9F.h.22. 1 .P.Seq 


F 


M00033144A:D02 


CH08LNH 


623 


32154 


RTA00002905F.b. 16. l.P.Seq 


F 


M00007969D:C01 


CH03M.AH 


624 


20875 


RTA00002901F.k. 16. l.P.Seq 


F 


M00005603B:H03 


CH02COH 


625 


186324 


RT A000029 1 2F.d. 17.1 .P.Seq 


F 


M00027274A:A09 


CHC4MAL 


626 


10768 


RTA00002SS6F.e.O 1 . 1 .P.Seq 


F 


M00001346B:G11 


CH01COH 


627 


16711 


RTA00002935F.m. i 1 . 1 .P.Seq 


F 


M00055221C:H11 


CH17C0HLV 


628 


146S8 


RT A00002925R n. 1 4. 1 . P.Seq 


F 


M00040023B:B10 


CH09LNL 


629 


44419 


RTA0O0O2907F.b. 19. l.P.Seq 


F 


M000221 1SA:E06 


CH03\LAH 


630 


12614 


RTA00002S96F.p.03. 1 .P.Seq 


F 


M00004201D:C01 


CHOICOH 


631 


21658 


RTA00002902F.C.23. l.P.Seq 


F 


iM00006576D:C02 


CH02COH 


632 


10150 


RTA00002901F.L16. l.P.Seq 


F 


M00005540A:F09 


CH02COH 


633 


185909 


RTA00002912Rc.20. l.P.Seq 


F 


M00O27262A:AO7 


CH04MAL 


634 


14S93 


RTA0OOO2S90F.f.0S. l.P.Seq 


F 


M00001607D:H09 


CHOICOH 


635 


32125 


RTA00002903F.C.0S. l.RSeq 


F 


M000068S4D:A08 


CH02COH 


636 


11909 


RTA00002902F.a. 1 1 . 1 .P.Seq 


F 


M00005766D:D12 


CH02COH 


637 


17237 


RTA0000290 1 RI. 1 2. 1 .P.Seq 


F 


M00005616B:F07 


CH02COH 


638 


11148 


RTA00002900F.j. 1 S. 1 .P.Seq 


F 


M00005346D:A03 


CH02COH 


639 


14837 


RTA00002925F.n.20. l.P.Seq 


F 


M00040025A:B04 


CH09LNL 


640 


4343 


RTA00002S97RI. 13. 1 .P.Seq 


F 


M000042S2BD07 


CHOICOH 


641 


1S6S6 


RTA00002S9SFJ. 16. 1 .P.Seq 


F 


M00004366D:Cll 


CHOICOH 


642 


10090 


RTA00002S92F.n. lO.Z.RSeq 


F 


M00003S42D:H09 


CHOICOH 


643 


612 


RTA00002SS9Rd. l3.2.P.Seq 


F 


M00001535B:E02 


CHOICOH 


644 


10752 


RTAOO0O2S92Rn.06.2.P.Seq 


F 


M00003S42D:Dll 


CHOICOH 


645 


167203 


RTA000029 UF.c. 14. 1 .P.Seq 


F 


M0002S070A:H09 


CHOSLNH 


646 


21269 


RTA0000290 IF.j. 15. 1 P.Seq 


F 


M00005570A:B0S 


CH02COH 


647 


186250 


RTA000029 1 OF.a.2 1 . 1 .P.Seq 


F 


M00O22797D.A06 


CH03MAH 


64S 


24633 


RTA00002907Ri. 19.2.P.Seq 


F 


M00O:220SB:D03 


CH03MAH 


649 


12295 


RTA000029 1SF.C.02. 1 P.Seq 


F 


M00032S36B:A07 


CHOSLVH 


650 


7870 


RTA00002905F.b.22. 1 P.Seq 


F 


M00007973B:Di I 


CH03M.AH 
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SEQ 
ID 


CLUSTER 


SEO NAME 


ORFENTATION 


f CLONF ID 


I TOP A D V 


OJ 1 


1 LSI J 


Ix 1 r.Q.Uo. l.r.oeQ 


r 


\ .f AAAA A -s 0 ^ \ .P\A"7 

IvLUUUUoD o J A: UU i 


CH02COH 


AS7 


777*\ 
1 I ID 


Ix 1 AUUVJU— 07_r.O. IZ.Z.r.oeq 


r 


V,fAAAA7Q t ~ \ . II A t 

ivlUUUOjo** ; A:HU4 


CH01COH 


At"? 




k. i A(XKjvj-^y_vr.r — 1 . i.r.seq 


t* 
r 


V/fAAA,1A1 inn.nm 

MUU04U J49 U. UK) 1 


CH14EDT 


OJ*+ 


AQ1 1 
OO J 1 


nj \ AAAA">Q77P K 7 1 1 D 

K 1 AUUUU-yz /r.D.z 1 . I .r.Seq 


r 


MUU0j948jA:L) 10 


CH12EDT 


OJJ 


lU / JO 


K 1 AOUUUzyjUr.o.Uo. 1 .r.Seq 


r 


KvfAAA iTTT f \ f^f\£. 

MU004 1 1 24 A : CjOo 


CHIjCON 


A** A 
OJO 


1 7GQA 

I /9oo 


DT \AAAA7niop ~ 7A 1 D C^n 

K 1 AUUUU.Vjzr.a.zU. I. r.Seq 


r 


XifAAA 1 1AT ~s f~* . CA ♦ 

MU0U429 / 2C :r*04 


CH18CON 


A>7 
O J / 


01 1 A1 
Zj lOJ 


DT A AAAfT*QG^ P K Al 1 D C_>„ 

k i AUUUUj.oVjr.n.uj. i.r.oeq 


r 


\ A A A A A /I AOC \ . IT A t 

MU000408 j A . HO I 


CH01COH 


A^a 
OJO 


40JO 


t?t" a aaaatqtjp i i ^ i d 
K l .-\uuuu_yz jr. 1. 1 j. l .r.ocq 


r 


IV /f AAA "5 AO O 1 T~a . [JA"1 

MU00j9zo41J:rl0/ 


CH09LNL 


A*^Q 
OJ" 


ZjJOU 


dt a AAnmon^p ^ n< t ocan 
K i .Auuuuzyuj r .e.uj . i .r.oeq 


r 


TlyTAAAAO AAT D -PA7 

MUUUUoUU/ t3:tUj 


LH03MAH 


AAA 
OOU 
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CH01COH 


794 


3272 


RT A00002897F.a. 1 S. 1 .P.Seq 


F 


M000042i:D:C03 


CH01COH 


795 


14446 


RTA00002S99F.d.05. l.P.Seq 


F 


M00004462D:D12 


CH01COH 


796 


17865 


RTA000029l8F.a. 13. l.P.Seq 


F 


M00032S25B:F03 


CH08LNH 


797 


5834 


RTA00002S9SF.h. 12. l.P.Seq 


F 


M0000435:A:D0S 


CH01COH 


798 


14533 


RTA0000:S96F.k.24. 1 .P.Seq 


F 


M0000417^C:B06 


CH01COH 


799 


15222 


"RTA00002900F.j.05. 1 P.Seq 


F 


M0000533:A:C06 


CH02COH 


800 


22594 


RTA000O2S9SF.h.2 1 .l.P.Seq 


F 


M0000435"3:B06 


CH01COH 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE [D 


LEBR.ARY 

— 


Qf\ 1 
oU I 


y^U4 


DTAnnnmxQDPh on i p c*»n 

K 1 nUWUU-07Ur.fl._U. l.r.oeq 


C 

r 




Lnu i v_ w ii 




lo040 ,£ * 


ota nnrmoo 1 1 f h dq o p c»n 

K 1 1 I r u.U7 — .r . oeq 


r 








t 1 


dt a nfifwioofyoF 'ill IP 


r 


M000048' , 4D • H05 


CH0TOH 








F 
r 


M0O0O3935B B01 


CH01COH 




15 JJ 1 


DT AnnOfPQ 1 SP i IS 1 P 

t\ 1 AUwUi" I jr.J. I J - 1 .r .jcq 


F 


M0003''47 ID- A05 


CH0SLNH 


oUO 


1 1 1 OG 


dt Afinnno^cwP i n i p c,*n 


F 


M000043 1 OR FO^ 


CH01COH 


om 


looj /0 


DTA AnOTiOQ 1 OP If 0 I 1 P C^n 


c 
r 


M00fP748 F07 




ono 
oUo 


1 "79 ! A 

1 /o lo 


DT A (VM^CiOQO 1 P a ClA 1 P C^n 
K i .AUUlA/i7Uir.O.V>4. l»r.ocq 


r 


M00005674C F04 


CHO^COH 




o4J4 


dt a nnnno ao-ip 1 00 \ P C.»n 


F 




CH09LNL 


olU 


i_ 140 




p 
r 


M 000^0067 R F07 


CH09LNL 


oil 


7 1 0 1 0 




F 


Mf)0007 1 S4 A ■ F06 


CHO^COH 


Q 1 O 


1 1 Q"7 

14o / 




p 
r 


Mnnrkinn i f>c ■ Fn7 


CH09LNL 

IV 7 L. 1 i La 


OlJ 


0/1777 
Z 4 / / / . 


DTAnnnmonnp n no i p 


F 
r 


Mnnnns^soR hio 

iviwuuJJOvD • n i \j 


CHO^COH 


Q 1 /I 
514 


1 AAAQ1 


d t a nfifiooorn f h ni i p 


F 
r 


MOOOOnS?^ A H 10 


CfKPCOH 


O 1 5 


0540 




p 
r 




CH17COHLV 


O 1 rf£ 

olo 


DVo4 




c 
r 


Monns^don a -FOn 


CH17COH7 V 

V_ 1 1 1 / V_ v_/ 1 l_L~i V 


ol / 


1 /* A A t 

Z4441 




p 
r 


MnnnfLi9i o n ■ nru 


fHOTOH 




20889 


K 1 AUUUlLiV J5r.fl.U9. 1.r.oeq 


c 
r 




pHI7Pnt-T7 V 
tni i Luru. v 


O ■ A 

819 


127721 


Rl A00UU29l5r.C.lo. l.r.oeq 


IT 

r 


\/fnnnoQ7A7 a r**. i i 

MUUU-C) / D J* A . vj I 1 


Lnuo Lin n 


820 


20684 


d t \ AAnmofYic ^» m 1 p Cjfl 
K 1 AUUUUZyuUr.C.Lu. l.r.oeq 


c 
r 






821 


30095 


K 1 AUUUUiyU/ r.i.^U.z.r .oeq 


c 
r 




fUA'lM AH 


822 


6763 


K 1 AUUUU-o9_r.o.U1.2.r.oeq 


r 






823 


6763 


K 1 AUUUU-io9-r.n.24.2.r.oeq 


c 
r 


\yfonnri79j: ^ n -nn^ 




824 


48725 


DT,\AAf\A1Qn"C t OO ~) D 

K 1 AUUUU-9U / r.i-ii.-i.r.oeq 


p 
r 


Yinnno oo _in r c i ^ 


furiiM AH 


OZ5 


21260 


K 1 auuuu_v j_*r.c.--. i.r.ocq 


p 
r 


Monn^ j_ioo a cor 


CHI 7 COHLV 


OO/C 

820 


420/2 


dta rwv^noo. ^n"c r. t i id 

K 1 AUUuUiVjUrX.zl.l.K.oeq 


p 
r 




ru | SPON 
v_. n i j V— v_/ 1 ^ 


82/ 


i /i /* i 
3441 


K 1 AUUUU-iV J jr. 1. 1 J. I ,-r.ocq 


p 
r 




CHI7COHLV 


QIC 


1 1 1 lA 

214 19 


dt vn^nmQ'jnc k i * 1 p 


p 
r 




CHI5CON 


82y 


o aa ,i 


dt a nnn/Yio, 1 nc k nQ 1 p 

K 1 AUUUU-V lUr.O.Uo. i.r ,oeq 


p 
r 


Mnnoooon^R • A 1 0 


CHO^M AH 


830 


t o coin 

185870 


K I AUUUUi9 L.r.C.UO. l.r.5eq 


p 
r 


v^nnn"* 70 j"/ - • nno 


CH04M AL 


831 


24580 


K 1 AUUUUiyjUr.a.Ul . l.r.5eq 


p 
r 


ivi j j o o -*a . r u v> 




832 


5 153 


K 1 AUUw^y jUr.D. 10. i.r.ocq 


p 
r 


N/innnd0 7j,"in-n i n 


CH15CON 


83j 


8653 


K l AUUUUioy?r.r. i /. l.r.oeq 


p 
r 


iV/i nnnHiin ^ n p • c ru. 




834 


2j/99 


dt \nnnniQ0 ip 1 11 1 p 
k 1 auuuu— y _-+r j. 1 .r . ocq 


p 
r 


iVIL/UU J"U yOv_ . DUJ 


PH09LNL 


835 


1 1012 


dt \ AAnmoinc ■ aq t d Can 
K 1 AUUUUiy^Ur.J.uy. 1 .r.oeq 


P 
r 


Mfinn^Ao i sn*Fno 

IVIwUJO- I J U . i w_ 


phi sroM 


OjO 


/(/CAT 

40592 


DTAnAAnoonnp k iq 1 p c -.^ 
k 1 AUuuu_yuur.o. iy. i .r .oeq 


p 
r 


A/innonds^oRp i o 


CHOTOH 


83/ 


0050 


K 1 AUUUU-yuor.rn, 11. 1 .r.oeq 


p 
r 


\/fnnno^ jo i n- a i o 


run^M AH 


o3o 


1 t O 

loo lo 


dt Annnno^QOP « is 1 p 
k i Auuuuioovr .n. 10. 1 .r.ocq 


F 
r 


vinnnoi ^A^r- ao i 


CHOICOH 


OTA 

839 


ton i 
182/4 


K 1 AUUUU-LOoyr.g.Uj. l.r.ocq 


p 
r 






O /I A 

840 


20694 


K 1 AUUUUiyUor.n.Uo. l.r.^eq 


p 
r 


IVlwUU— _~r-+_ D .UUJ 


PH0"^M AH 


84 1 


9493 


K i Auuuuiyuyr.m. 1 1 . i.r.oeq 


F 

r 




PHO^iVI AH 


84^ 


fin'? 

V 1 Ji 


RTA0000' ) S97F c 04 1 P Sea 


F 


M00004220D:C 1 1 


CHOICOH 


843 


186259 


RTA000029i:F.m.l3.1.P.Scq 


F 


M00027527B:C05 


CH04M.AL 


844 


3769 


RTA00002916F.g.22. l.P.Seq 


F 


MO0O325SlB:AO9 


CHOSLNH 


845 


365 S4 


RTA00002935F.f.l2.LP.Seq 


F 


M000546S?D:Gll 


CH17C0HLV 


846 


38077 


RTA00002890F.e.06. 1 .P.Seq 


F 


M0000I605B:B05 


CHOICOH 


847 


3927 


RTA00002935F.a. 12. l.P.Seq 


F 


M000425l6B:DOl 


CH17C0HLV 


848 


4275 


RTA000029 UF.b. 16. 1 .P.Seq 


F 


M0002SOc?C:HOl 


CHOSLNH 


849 


12554 


RT A000029: 1 F.a.23. 1 .P.Seq 


F 


M00033302A:Ell 


CH09LNL 


850 


13761 


RTA0000290lF.f.22. l.P.Seq 


F 


M000054895:COS 


CH02COH J 



its 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


851 


19059 


RTA00002 S 9 7F. e . 22. L P . Seq 


F 


M00004237C:D10 


CHOICOH 


852 


22944 


RTA00002935F.b.l7.l.P.Seq 


F 


M00043355A:D07 


CH17COHLV 


853 


2189 


RTA00002925FJ.06. l.P.Seq 


F 


M00039921A:B10 


CH09LNL 


854 


19153 


RTA00002392F.h.04.2.P.Seq 


F 


M00003819B:B01 


CH01COH 


855 


1833 


RT A00002890F.e. 1 3. 1 .P.Seq 


F 


M00001606B:A10 


CHOICOH 


856 


18447 


RTA00002935F.d.23.1.P.Scq 


F 


M00054569A:B07 


CH17C0HLV 


857 


2461 


RTA00002922F.b-0S. 1 .P.Seq 


F 


M0003S619B:F09 


CH09LNL 


858 


15917 


RTA00002896F.J.06. 1 .P.Seq 


F 


M00004172C:A08 


CHOICOH 


859 


9379 


RTA00002935F.a. 15. l.P.Seq 


F 


M00043299A:B10 


CH17C0HLV 


860 


5511 


RTA0000293lF.b.06.1. P.Seq 


F 


M00042796A:A10 


CH 1 6C0P 


861 


10540 


RT A0000289 IF.k. 1 6. l.P.Seq 


F 


M00003764B:H11 


CHOICOH 


862 


12117 


RT A00002 8 99 F. a. 09. 1 . P. Seq 


F 


M00004419A:G02 


CHOICOH 


863 


8777 . 


RT A000029 1 9F.a.23. 1 .P.Seq 


F 


M00033028D:C10 


CH08LNH 


864 


23972 


RT A00002900F.0. 1 8. 1 .P.Seq 


F 


M00005403C:A01 


CH02COH 


865 


17005 


RTA00002S96F.m. 10. l.P.Seq 


F 


M00004187B:C02 


CHOICOH 


866 


1085 


RTA00002924F.1.20. l.P.Seq 


F 


M00039694C:H01 


CH09LNL 


867 


4270 


RT A00002922F.a.24. 1 .P.Seq 


F 


M00038616C:C09 


CH09LNL 


868 


4609 


RTA00002935F.e. 15. l.P.Seq 


F 


M00054599D:B03 


CH17C0HLV 


869 


6889 


RTA000029 1 9F.C.07. 1 .P.Seq 


F 


M00033037B:F04 


CHOSLNH 


870 


15228 


RTA00002919F.e.06. l.P.Seq 


F 


M00033055D:D02 


CHOSLNH 


871 


20971 


RT A00002904 F. a . 2 2 . 1 .P.Seq 


F 


M00007158D:D03 


CH02COH 


872 


5174 


RTA00002935F.a.23. l.P.Seq 


F 


M00043313D:E09 


CH17C0HLV 


873 


15236 


RTA00002928F.e. 1 6. 1 .P.Seq 


F 


M0004019SA:F12 


CH13EDT 


874 


9223 


RTA00002S96F.b. 15. 1. P.Seq 


F 


M00004141 A:D01 


CHOICOH 


875 


24591 


RT A00002923F.SJ. 10. l.P.Seq 


F 


M00039251C:H12 


CH09L NT- 


876 


36306 


RTA00002888Rl.il. l.P.Seq 


F 


M000014S5CF06 


CHOI COH 


877 


3309 


RT A00002893F.j.2 1 . 1 .P.Seq 


F 


M00003916A:E04 


CHOICOH 


878 


186712 


RTA 000029 1 1 F.c. 1 1 .2.P.Seq 


F 


M00026S09A:H08 


CH04MAL 


879 


9090 


RTA00002S9 IF.s.23. l.P.Sea 


F 


M00003746C:E11 


CHOICOH 


880 


11510 


RTA00002888F.i.07. 1 .P.Seq 


F 


M00001467C:D04 


CHOICOH 


881 


9784 


RTA00002889F. j. 15. 1 .P.Seq 


F 


M00001554CG10 


CHOICOH 


882 


25618 


RTA00002930F.a. 1 1 . i P.Seq 


F 


M00042554A:D01 


CH15C0N 


883 


12493 


RTA00002928F.i. 1 1. l.P.Seq 


F 


M000402S9D:C06 


CH 1 3EDT 


884 


24361 


RTA00002933F.b.08. 1 .P.Seq 


F 


M00043134A:F05 


CH19C0P 


885 


12449 


RTA00002930F.d.02. l.P.Seq 


F 


M00055468A:A08 


CH15C0N 


886 


17894 


RTA00002929F.a.04. 1 .P.Seq 


F 


M00039747B:B06 


CH14EDT 


887 


13204 


RTA000O2930F.f.O9. l.P.Seq 


F 


M00055745B:A08 


CH15C0N 


888 


32119 


RTA00002930F.C. 19. 1 P.Seq 


F 


M00055448B:E05 


CH15C0N 


889 


5909 


RTA 00002 93 5F.i. 23. l.P.Seq 


F 


M00054931D:E10 


CH17C0HLV 


890 


24453 


RTA00002927F.d. 15. 1 P.Seq 


F 


M0OO39526A:AO8 


CH12EDT 


891 


46982 


RTA00002935F.k.22. 1 .P.Seq 


F 


M00055093B:A03 


CH17C0HLV 


892 


43888 


RT A00002932F.b.23. 1 .P.Seq 


F 


M000430^0A:C03 


CHI SCON 


893 


24580 


RTA00002930F.C.24. 1 P.Seq 


F 


M000554o6A:F06 


CH15C0N" 


894 


186495 


RT A00002927F.a.2 1 . 1 .P.Seq 


F 


M000393o4D:E05 


CHI2EDT 


895 


12420 


RTAOO0O2932Fb.21. l.P.Seq 


F 


M000430c3C:H05 


CHI SCON 


896 




RTA00002916F.e. 14. l.P.Seq 


F 


M0OO32562C:FOl 


CHOSLXH 


897 


10438 


RTA00002930F.]. 1 3. 1 P.Seq 


F 


M00056:30D:E07 


CH15C0N 


898 


12367 


RTA00002922F.n. 1 0. l.P.Seq 


F 


M00039 1 33 B:D06 


CH09LNL 


899 


5012 


RTA00002930F.k. 16. l.P.Seq 


F 


M0005tv42A:C03 


CH15C0N 


900 


6458 


RTA00002929F.C.2 1 . l.P.Seq 


F 


M00040:^1A:G10 


CH14EDT 



I -Up 
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in 


LLUo I cK 




UKlhN 1 A 1 lUN 




LIdKAK I 


901 


16507 


RTA00002929F.d.08. 1 .P.Seq 


F 


M00040298B:B09 


CH14EDT 


902 


13914 


RTA00002922F.h. 1 8. 1 .P.Seq 


F 


M000 j906 jC :H09 


CH09LNL 


903 


11590 


RTA00002930F.k. 1 8. 1 .P.Seq 


F 


M000^64^6C:F01 


CHbCON 


904 


15380 


RTA00002928F.L06. 1 .P.Seq 


F 


M0004028/A;C1 1 


CHI3EDT 


905 


10190 


RT A00002 895 F. k.07 . LP. Seq 


F 


M00004096D:F02 


CH01COH 


906 


12593 


RTA00002934F.a. 1 2. 1 .P.Seq 


F 


M0004j48^C:C0^ 


CH20COHLV 


907 


112813 


RTA00002905F.0. 14. 1 .P.Seq 


F 


M00021677A:D09 


CHO^iVtAH 


908 


15929 


RT A00002930F. [. 1 8. 1 .P.Seq 


F 


MO0OD6244C.H05 


CHIdCON 


909 


16670 


RTA00002935F.0. 16. 1 P.Seq 


F 


M0005o^8>C:C12 


CH17COHLV 


910 


10924 


RTA0000290yF.k.I2.2.P.Seq 


F 


M00022224A;C07 


CHOjMAH 


911 


6233 


RTA00002896F.b. 17, 1. P.Seq 


F 


M00004UiB:B01 


CH01COH 


912 


14777 


RTA00002897F.k.09.1.P.Seq 


F 


M000042 / /D:B02 


CH01COH 


913 


12797 


RTA00002935F.h.0 1 . 1 .P.Seq 


F 


M000d4/81D:A1 1 


CH17COHLV 


914 


186041 


RTA000029 12F.C.01 . 1 .P.Seq 


F 


M00027244C:B06 


CH04M.AL 


915 


8182 


RTA0000293 1 F.a.22. 1 .P.Seq 


F 


M00042/66C:D0^ 


CH16COP 


916 


23088 


RTA00002388F.p.20. 1 .P.Seq 


F 


M00001506B:D11 


CH01COH 


917 


24298 


RT A00002935F.p. 1 8. 1 .P.Seq 


F 


M00055473C:F02 


CH17COHLV 


918 


40621 


RTA00002896F.k. 12.1 .P.Seq 


F 


M00004176C:A09 


CH01COH 


919 


7124 


RTA0O0O2935F.b.07.1.P.Seq 


F 


M00043328C:E04 


CH17COHLV 


920 


21107 


RTA0000290 1 F.i.02. 1 .P.Seq 


F 


M00005524C:H04 


CH02COH 


921 


10807 


RTA00002928F.l\ 1 5. 1 .P.Seq 


F 


M00040162A:E02 


CH13EDT 


922 


12162 


RTA00002915F.j.23.LP.Seq 


F 


M00032475A:A06 


CH03LNH 


923 


14747 


RTA0000293 IF.a. IS. 1. P.Seq 


F 


M00042512D:D10 


CH16COP 


924 


6824 


RTA0000293 1 F.b.23. 1 .P.Seq 


F 


M00042S57C:E01 


CH16COP 


925 


39115 


RTA00002932F.a. 1 7. 1 .P.Seq 


F 


M00042967D:C0l 


CHI SCON 


926 


9484 


RTA00002934F.a. 13.1. P.Seq 


F 


M00043490CF02 


CH20COHLV 


927 


77981 


RTA00002890Fj.2 l.l.P.Seq 


F 


M00001633D.C11 


CH01COH 


928 


16061 


RTA00002932F.b.24. 1 .P.Seq 


F 


M000431 13C:G09 


CHI SCON 


929 


4334 


RT A00OOZ93OF.d. 1 2. 1 .P.Seq 


F 


MO0055527B:E01 


CHACON 


930 


9427 


RTA0000:935F.e.2 1.1. P.Seq 


F 


M00054623OF05 


CH17COHLV 


931 


167736 


RTA00002935F.I.1 l.l.P.Seq 


F 


M000551 17A:E02 


CH17COHLV 


932 


16524 


RTA00002935F.b. 19. 1 .P.Seq 


F 


M000433^8C:A02 


CH17COHLV 


933 


23496 


RTA00002932F.a.2 1.1 .P.Seq 


F 


M000429"*6D:C01 


CHI SCON 


934 


163647 


RT A0000290 1 F. m. 1 7 . 1 .P.Seq 


F 


M0000^o34A:F07 


CH02COH 


935 


14239 


RTA0000293 1 F.b.07 . 1 .P.Seq 


F 


M00042S01C:D01 


CH16C0P 


936 


25574 


RTA00002886F.2.20. 1 .P.Seq 


F 


M00001353C:A05 


CH01COH 


937 


2737 


RTA00002932F.a,0S. 1 .P.Seq 


F 


M000425S8C:E02 


CHI SCON 


938 


6925 


RT A00002930F. 1 .0 1 . 1 .P.Seq 


F 


M0005645 8C:E01 


CHACON 


939 


21106 


RTA0000:925F.p.l6.l.P.Seq 


F 


M00040045B:H07 


CH09LNL 


940 


28134 


RTA000029 17F.k.23. 1 .P.Seq 


F 


M00032765A:C05 


CH08LNH 


941 


186496 


RTA000029 17F.1. 1 6. 1 .P.Seq 


F 


M000327"0C:G1 1 


CH08LNH 


942 


21625 


RT A00002 S9 j»F . | . 1 S. 1 .P.Seq 


rr 

r 


M0000_> l J 1 DC :CjOo 


run 1 pr\U 


943 


12537 


RTA0000:930F.g.23. LP.Seq 


F 


M00055919B:C10 


CH15CON 


944 


15577 


RTA00002902F.r\ 15.1. P.Seq 


F 


M00006c36B:E04 


CH02COH 


945 


6106 


RTA000029?5F.r.l l.l.P.Seq 


F 


M00054cS2B:H02 


CH17COHLV 


946 


17136 


RTA0Q002935F.h.U.l.P.Seq 


F 


M00054S18BF10 


CH17COHLV 


947 


2582 


RTA000O292 i Ft. 14. 1. P. Seq 


F 


M000334i3A:A08 


CH09LNT 


943 


16638 


RTA00002S99F.a.l5.l.P.Seq 


F 


M00O0442OD:EO5 


CH01COH 


949 


S369 


RTA00002929F.a.05.l.P.Seq 


F 


M00039~4SC:G09 


CH14EDT 


950 


14426 


RTA000029 14F v. 1 2. 1 .P.Seq 


F 


M0002S069D-.H02 


CHOSLXH 
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ID 


CLUSTER 


C IT /~\ VIA XiC 

SEQ NAME 


ORIENTAI 1UIN 




LIBRARY 


951 


11994 


RTA00002S90F.h. 1 4. 1 . P.Seo 


F 


M0000161 ,C:F10 


CHOICOH 


952 


186664 


RTA00002932Fa.05.lP.Seo 


F 


M00042iSrD:D03 


CH1SCON 


953 


162235 


RTA00002907F.j.06.2.P.Seq 


F 


M00022212D:G02 


CH03MAH 


954 


2127 


RTA000029 12F.0. 1 4. 1 .P.Seq 


F 


M000276053:D09 


CH04MAL 


955 


41014 


RTA0000290 1 F.n.04. 1 .P.Seq 


F 


M00005641B:E09 


CH02COH 


956 


17636 


RTA0O0O293 3 F.c . 1 9. 1 . P.Seq 


F 


M00043222C:B06 


CH19COP 


957 


2328 


RTA00002935F.e.05. l.P.Seq 


F 


M00054579A:C02 


CH17COHLV 


958 


15414 


RTA00002935F.p. 13. 1 .P.Seq 


F 


' MO0055423C.H10 


CH17COHLV 


959 


11948 


RT A00002895F.O.0 1 . 1 .P.Seq 


F 


M00004118C:D12 


CHOICOH 


960 


24759 


RTA00002 903 F. n .05 . 1 . P. Seq 


F 


M000070S2D-.E05 


CH02COH 


961 


15152 


RT A00002925 F. g.O 1 . 1 . P.Seq 


F 


MO0039873B:HO4 


CH09LML 


962 


14917 


RTA00002922F.b.02. 1 .P.Seq 


F 


M00038616D:B07 


CH09LNL 


963 


12941 


RTA00002S89F.C. 15. 1 .P.Seq 


F 


M00001532A:G08 


CHOICOH 


964 


29676 


RTA0000293 1 F.b.03. l.P.Seq 


F 


M00042788A:F04 


CH16COP 


965 


17789 


RTA00002891F.a.21. l.P.Seq 


F 


M00001671A:H10 


CHOICOH 


966 


45097 


RTA00002928F.g.06. l.P.Seq 


F 


M00040247D:D02 


CH13EDT 


967 


18407 


RTA00002909F.b. 1 1. l.P.Seq 


F 


M00022546B:E05 


CH03MAH 


968 


22309 


RTA00002900F.n. 19. l.P.Seq 


F 


M00005392A:G06 


CH02COH 


969 


109382 


RTA00002907F.k. 13. l.P.Seq 


F 


M00022224A:G07 


CH03MAH 


970 


92273 


RTA00002909F.j. 1 7. 1 P.Seq 


F 


M00022662D:H03 


CH03MAH 


971 


8403 


RTA000029 l5F,j.22. 1 .P.Seq 


F 


M00032474A:G03 


CH08LNH 


972 


7763 


RTA00002 928F.h. 10.1. P.Seq 


F 


M00O4O267D:A12 


CH13EDT 


973 


13470 


RTA0O0O293OF.k.O9. 1 .P.Seq 


F 


M00056304A:H05 


CH15CON 


974 


1484 


RTA0000292 1 F.k. 1 0. 1 .P.Seo 


F 


M00033556D:C10 


CH09LNL 


975 


10345 


RTA00002S92F.0. 19.2.P.Seq 


F 


M0000384SC:G09 


CHOICOH 


976 


17242 


RTA00002931F.a.05. l.P.Seq 


F 


M00042433A:E11 


CH16COP 


977 


171180 


RTA00002909F. f.24. 1 .P.Seq 


F 


M0002261SB:D09 


CH03MAH 


978 


16790 


RTA000029 14F.C.03. 1 .P.Seq 


F 


M0002S067A:C11 


CH08LNH 


979 


139516 


RTA00002903F.1.02. l.P.Seq 


F 


M00O07052CA12 


CH02COH 


980 


4825 


RTA00002930F.b. 15. l.P.Seq 


F 


M00042742B:E04 


CHbCON 


981 


8830 


RTA00002930F.a.05. 1 .P.Seq 


F 


M00042528C:H0I 


CH15CON 


982 


12398 


RTA00002935F.0. 19. 1 .P.Seq 


F 


M00055391B:C07 


CH17COHLV 


983 


17867 


RT A00002 900F. c . 1 4 . 1 . P. Seq 


F 


M00004850A:B02 


CH02COH 


984 


15796 


RTA00002935F.b. 12. l.P.Seq 


F 


M00043339C:Fll 


CH17COHLV 


985 


185669 


RTA0O0O2935F.f. 1 3. 1 .P.Seq 


F 


M000546S6A:A09 


CH17COHLV 


986 


13638 


RT A00002935F.j.20. 1 .P.Seq 


F 


M00055002B:E08 


CH17COHLV 


987 


8280 


RTA00002930F.e. 12. 1 P.Seq 


F 


M00055653C:B07 


CH15CON 


988 


12632 - 


RTA0000293 lF.c.03. 1 .P.Seq 


F 


M00042860B:C07 


CH16COP 


989 


7620 


RTA00002935F.m. 1 S. 1 .P.Seq 


F 


M00O55240A:A08 


CH17COHLV 


990 


23922 


RTA00002935F.m.20. l.P.Seq 


F 


M00055244B:F07 


CH17COHLV 


991 


43864 


RTA0O0O293 lF.b.05. l.P.Seq 


F 


M00042794A:F01 


CH16COP 


992 


34478 


RTA0O0O2929F.2. t_v l.P.Seq 


F 


M00040j6/A:C0S 


LH 14hL>l 


993 


6861 


RTA0O0O2933F.C. 17. 1 .P.Seq 


F 


M00043221D:C12 


CH19COP 


994 


13971 


RTA00002933F.b.01. l.P.Seq 


F 


M00043099A:H04 


CH19COP 


995 


13971 


RTA0O00:933F.a.24. l.P.Seq 


F 


M00043099A:H04 


CH19COP 


996 


13244 


RTA000O2927F.C.0S. l.P.Seq 


F 


M00039537A:F0S 


CH12EDT 


997 


7455 


RT A00002935F.d. Ill .P.Seq 


F 


MO005452SB:E05 


CH17COHLV 


998 


18915 


RTA0O002929F.b.2 1. l.P.Seq 


F 


M00040201A:H01 


CH14EDT 


999 


4023 


RTA0OOO:935F.h.O3. l.P.Seq 


F 


M000547S6OD03 


CH17COHLV 


1000 


10785 


RTA0OOOZ933F.a.l 1.1. P.Seo 


F 


M00043074C:D07 


CH19COP 



ivl 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


1001 


14851 


RTAO0O02928F.i.07.1.P.Scq 


F 


M000402S7CF10 


CH13EDT 


1002 


109382 


RT A00002 907 F. k. 1 3 . 2 . P. Seq 


F 


M00022224A:G07 


CH03M.AH 


1003 


23878 


RTA00002933F.b.03.l.P.Seq 


F 


M00043101D:G11 


CH19C0P 


1004 


27516 


RTA00002927F.U7. l.PSeq 


F 


M00039598A:E04 


CH12EDT 


1005 


9652 


RTA0000293 lF.c.04. l.P.Seq 


F 


M00042863D.F09 


CH16C0P 


1006 


24729 


RT A0000293 1 F.a. 1 2 . 1 P.Seq 


F 


M00042462B:C02 


CH16C0P 


1007 


186041 


RTA00002912F.b.24. 1 .P.Seq 


F 


M00027244CB06 


CH04MAL 


1008 


12282 


RTAO0O02935F.i.l8.1.P.Seq 


F 


M00054908CA01 


CH17C0HLV 


1009 


10704 


RTA000G2930F.f. 12. l.P.Seq 


F 


M00055757A:B01 


CH15C0N 


1010 


3397 


RT A000029 30F . h .08 . 1 .P.Seq 


F 


M00055975B.F09 


CH15C0N 


1011 


35256 


RTA00002886F.m. 16. 1 .P.Seq 


F 


M00001374A:B02 


CHOICOH 


1012 


1448 


RTA00002900F.2.05. 1 -P.Seq 


F 


M00005002A:C03 


CH02COH 


1013 


1259 


RTA00002922F.n.08. 1 .P.Seq 


F 


M00039I31C:B09 


CH09LNL 


1014 


16903 


RTA00002935F.a.01. l.P.Seq 


F 


M00042352B:A04 


CH17C0HLV 


1015 


7884 


RTA00002922F.a. 13.1 P.Seq 


F 


M00038390B:F02 


CH09LNL 


1016 


5976 


RTA00002930FJ.il. l.P.Seq 


F 


M00056220D:G02 


CH15C0N 


1017 


16832 


RT A00002888F.h.20. 1 .P.Seq 


F 


M00001466B.F03 


CHOICOH 


1018 


10490 


RTA00002886F.k.07. l.P.Seq 


F 


M00001364C:H10 


CHOICOH 


1019 


6317 


RTA00002928F. s. 1 0. 1 .P.Seq 


F 


M00040252C:G05 


CH13EDT 


1020 


41215 


RTA00002897F.1. 11.1 .P.Seq 


F 


M000042S2A:D01 


CHOICOH 


1021 


6844 


RTA00002889F.I.21. l.P.Seq 


F 


MOO001562B:BO2 


CHOICOH 


1022 


L0456 


RTA00002897Fi,20, l.P.Seq 


F 


M00004242D:H01 


CHOICOH 


1023 


2720 


RTA0000289 1 F.i.07. 1 .P.Seq 


F 


M00003753A:C11 


CHOICOH 


1024 


12473 


RTA0000288SF.O.07. 1 .P.Seq 


F 


M0000149"C:FI0 


CHOICOH 


1025 


15840 


RTA000029 19F.n. 17. 1 P.Seq 


F 


M0003323CCG10 


CH08LNH 


1026 


6554 


RTA00002895F.b. 16. l.P.Seq 


F 


M0000406ZD:A02 


CHOICOH 


1027 


7330 


RTA000029 lSF.n.17.1 .P.Seq 


F 


M000329S"B:f01 


CHOSLNH 


1028 


2206 


RTA000029i9F.f. 12. 1 .P.Seq 


F 


M0003307IC:G05 


CH08LNH 


1029 


42705 


RTA00002935F.f.02. l.P.Seq 


F 


M00054643D:F07 


CH17C0HLV 


1030 


33865 


RTA00002930F.b.07. l.P.Seq 


F 


M0004272:C:C09 


CH15C0N 


1031 


5196 


RT A00002925F.0. 19. 1 .P.Seq 


F 


M00040034BG02 


CH09LNL 


1032 


8087 


RTA00002935F.0. 10. 1 .P.Seq 


F 


M00055375C:FI2 


CH17C0HLV 


1033 


20072 


RTA00002935F.2.06. l.P.Seq 


F 


M00054744C:F12 


CH17C0HLV 


1034 


12797 


RTA00002935F. 2.24. l.P.Seq 


F 


M000547S1D:A11 


CH17C0HLV 


1035 


3207 


RTA00002930F.C.04. l.P.Seq 


F 


M00054793B;AO6 


CH15C0N 


1036 


19600 


RTA00002929F.t.24, l.P.Seq 


F 


M00040351D.G07 


CH14EDT 


1037 


6278 


RT A00002935 Ff. IS. l.P.Seq 


F 


M00055001CG10 


CH17C0HLV 


1038 


19363 


RTA00002927F.e. 12. 1 P.Seq 


p 


M00039564D:D04 


CH12EDT 


1039 


15447 


RTA00002929F.2. 12. l.P.Seq 


F 


M000403663:HIO 


CH14EDT 


1040 


9676 


RTA00002932F.a. 14. 1 .P.Seq 


F 


M00042960B:C06 


CH18C0N 


1041 


12560 


RTA00002929F.h.06. 1 P.Seq 


F 


M000403S1A:B06 


CH14EDT 


1042 


12727 


RT A000029 3 3F.c. 15. l.P.Seq 


F 


M00043219C:C02 


CH19C0P 


1043 


27475 


RTA000029 1 4F.c. 16. 1 P.Seq 


F 


M00028070D.C03 


CHOSLNH 


1044 


30646 


RTA0000290SF.r. 1 1 l.P.Seq 


F 


M00022416D:D01 


CH03MAH 


1045 


455S5 


RTA00002925F h.20. 1 P.Seq 


F 


M00039S94CD09 


CH09LNL 


1046 


25025 


RTA00002925F.e. 18. 1 P.Seq 


F 


M00039SG03:EOl 


CH09LNL 


1047 


15715 


RTA000029 !9F.p.05. 1 P.Seq 


F 


M00033274DP03 


CHOSLNH 


1048 


381S5 


RTA00002926Fc07.2P.Seq 


F 


MO0O4OO7SA:CO7 


CH09LNL 


1049 


83S4 


RTA00002903F.0. 13.1 P.Seq 


F 


MO0O07h:D:DO3 


CH02COH 


1050 


8843 


RTA00002917Fh.171P.Seq 


F 


M00032733B:F12 


CHOSLNH 
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ID 


CLUSTER 




ORIENTATION 


LLUINt, LU 


LIBRARY 


1051 


21401 


RTA00002930F.e.20.2. P.Seq 


F 


M00055676A:G02 


CH15CON 


1052 


14434 


RTA00002903F.1.0 1 . 1 .P.Seq 


F 


M00007032A.B05 


CH02COH 


1053 


40045 


RTA0000289 IF.k. 1 4. 1 .P.Seq 


F 


M00003764A:H09 


CH01COH 


1054 


21853 


RTA00002S96F.S. 12. l.P.Seq 


F 


M00004lD9C:D10 


CH01COH 


1055 


23439 


RTA00002935F.p.0 1 . 1 .P.Scq 


F 


M00055402A:H0l 


CH17COHLV 


1056 


13060 


RTA00002934F.a. 19. l.P.Seq 


F 


M00043529A:B08 


CH20COHLV 


1057 


23439 


RTA00002935F.O.24. 1 .P.Seq 


F 


M00055402A:H0l 


CH17COHLV 


1058 


20547 


RTA0000293 IF.b. 12. 1 .P.Seq 


F 


- M00042822A:H04 


CH16COP 


1059 


4319 


RTA00002930F.a.03. 1 .P.Scq 


F 


M00042525B:HOi 


CHbCON 


1060 


21430 


RTA0000290 1 F.c. 1 8. 1 .P.Seq 


F 


M00005452B:G03 


CH02COH 


1061 


7668 


RTA00002935F.S.23. l.P.Seq 


F 


M000547S1B:H04 


CH17COHLV 


1062 


16239 


RTA00002935F.k. 19.1 .P.Seq 


F 


M00055081A:A05 


CH17COHLV 


1063 


5631 


RTA00002929F.e. 16. l.P.Seq 


F 


M00040326B:G09 


CH14EDT 


1064 


18362 


RTA00002928F.a.04. 1 .P.Seq 


F 


M00039739B:H12 


CH13EDT 


1065 


8034 


RTA00002932F.a.22. l.P.Seq 


F 


M00042982D:A10 


CH18CON 


1066 


12497 


RTA00002928F.a. 19. 1 .P.Seq 


F 


M00040132A:H09 


CH13EDT 


1067 


21001 


RTA00002932F.b.07. 1 .P.Seq 


F 


M00042996B:H08 


CH18CON 


1068 


471 


RTA00002927F.a. 1 1. l.P.Seq 


F 


M00039184D:H09 


CH12EDT 


1069 


10003 


RTA00002S97F.b. 13.1 .P.Seq 


F 


M00004215B:C05 


CH01COH 


1070 


16074 


RTA00002935F. f. 1 8. l.P.Seq 


F 


M0005470SC:B06 


CH17COHLV 


1071 


13698 


RT A00002902F. 1.0 1. 1 .P.Seq 


F 


M00006743A:Hll 


CH02COH 


1072 


24819 


RTA00002922F.J.03. l.P.Seq 


F 


M0003907SB:B03 


CH09LNL 


1073 


21511 


RTA00002S92F.i.0 1 . 1 .P.Seq 


F 


M00003S21C:E12 


CH01COH 


1074 


12402 


RT A00002929F.d. 15.1 .P.Seq 


F 


M000403UB:D07 


CH14EDT 


1075 


142755 


RTA00002903F.p.06. 1 -P.Seq 


F 


M00007I26A:A02 


CH02COH 


1076 


3010 


RTA0OO02935F.p. 12. l.P.Seq 


F 


M00055423CG12 


CH17COHLV 


1077 


17173 


RTA00002935F.k.09. l.P.Seq 


F 


M00055043B:H08 


CH17COHLV 


1078 


2969 


RTA00002933F.a. 12. l.P.Seq 


F 


M00043076D:A02 


CH19COP 


1079 


19600 


RTA00002929F.g.01. l .P.Seq 


F 


MC0040351D:G07 


CH14EDT 


1080 


8542 


RTA00002927F.h.24. l.P.Seq 


F 


M0003964~A:A02 


CH12EDT 


1081 


24795 


RTA00002927FX 10. l.P.Seq 


F 


M00039594CB06 


CH12EDT 


1082 


19695 


RTA00002927F.L03. 1 .P.Seq 


F 


M0003964"B:A02 


CH12EDT 


1083 


8542 


RT A00002927F. t .0 1 . 1 .P.Seq 


F 


M0003964"A:A02 


CH12EDT 


1084 


21409 


RTA00002902F.e.09. l.P.Seq 


F 


M00006601D:G05 


CH02COH 


1085 


186318 


RTA000029 l2F.h.07. 1 P.Seq 


F 


M00027363D:G04 


CH04MAL 


1086 


7379 


RTA0000290 IF.e. 10. 1 .P.Seq 


F 


M0000546^D:C0l 


CH02COH 


1087 


91285 


RTA00002909F.1. 12. l .P.Seq 


F 


M000226S2D:A10 


CH03NL J iH 


1088 


3285 


RTA00O02903F.m. 1 8. 1 P.Seq 


F 


M000070o4D:D12 


CH02COH 


1089 


6284 


RTA00002S96F.2.22. l.P.Seq 


F 


M00004loOD:G05 


CH01COH 


1090 


15676 


RTA00002935F.h.2 1. 1 .P.Seq 


F 


MO0O54S43A:C0l 


CH17COHLV 


1091 


34112 


RTA00002 S94F.0. 19. 1 .P.Seq 


F 


M00004053D:F09 


CH01COH 


1092 


16407 


RTA00002S92F.o.24.2.P.Seq 


F 


MOOOOj'lo 1 3: AO I 


/— Tj rv i (~- 1~\ II 

CHUICUH 


1093 


919 


RTA00002S90F.f. IS. l.P.Seq 


F 


M0000160v>D:CIl 


CH01COH 


1094 


59069 


RTA00O02S96F.k.O7. 1 .P.Seq 


F 


M00004175D:E06 


CH01COH 


1095 


31167 


RTA00002900F f.05. 1 .P.Seq 


F 


M00004S7cB:A06 


CH02COH 


1096 


23873 


RTA0OOO293OF.i.03. l.P.Seq 


F 


M00O56035D:A0S 


CH15CON 


1097 


15679 


RTA00002900F.h. 1 2. l.P.Seq 


F 


M000050lcC:E04 


CH02COH 


109S 


31852 


RTA000029L lF.o.22. l.P.Seq 


F 


MO0027rOD:C07 


CH04MAL 


1099 


39030 


RTA00002S9 1 F.f.07. 1 .P.Seq 


F 


M0000169:C:C04 


CH01COH 


1 100 


16407 


RTA00002S92F.p.01.2.P.Seq 


F 


M00003S5lB:AOl 


CH01COH 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ED 


LIBRARY 


1 101 


4118 


RTA00002935F.a. 17. 1 .P.Seq 


F 


M00043306D:COl 


CH17COHLV 


1102 


11054 


RTA000029 l5F.n.07.2.P.Scq 


F 


M00032504B.B10 


CH08LNH 


1103 


186640 


RTA000029 1 8F.d.03. 1 .P.Seq 


F 


M0003284SD:B10 


CH08LMH 


IL04 


9301 


RTA00002SS7FX22. l.RSeq 


F 


M00001396D-.H02 


CH01COH 


1105 


13438 


RTA00002901F.a. 1 1. 1. P.Seq 


F 


M00005422D:H10 


CH02COH 


1 106 


23691 


RTA00002901F.g.2I.l.P.Seq 


F 


M00005506CE09 


CH02COH 


1107 


32788 


RTA00002909F.m.04. 1 .P.Seq 


F 


M0O022694A:F05 


CH03MAH 


1108 


34364 


RTA000029l5F.o.09.2.P.Seq 


F 


M00032515A:B12 


CH08LNH 


1109 


24840 


RTA00002908F.L01. l.RSeq 


F 


M00022452B:E06 


CH03MAH 


1110 


3416 


RTA000029 1 lF.j. 17. l.RSeq 


F 


M00027036A:B06 


CH04MAL 


nil 


16889 


RTA00002930F.1.04. 1 .P.Seq 


F 


M00055724B:E04 


CH15CON 


1112 


2159 


RTA00002929F.f. 15. l.RSeq 


F 


M00040344C:D05 


CH14EDT 


1113 


8880 


RTA00002929F.f.02. 1 .P.Seq 


F 


M00040338A:B10 


CH 14EDT 


1114 


10722 


RTA0000292 lF.o.22. 1 .P.Seq 


F 


M00038304B:E02 


CH09LNL 


1115 


15046 


RTA000028S7F.C.08. l.RSeq 


F 


M00001389B:EIO 


CHOICOH 


1116 


13868 


RTA00002898F.B.18. l.RSeq 


F 


M000O4347B:E04 


CH01COH 


1117 


4226 


RTA00002925F.f. 15. l.P.Seq 


F 


M00039869A:H01 


CH09LNL 


1118 


90435 


RTAO0O029O9F.I.O6. 1. P.Seq 


F 


M00022678B:C08 


CH03MAH 


1119 


25686 


RTA00002900F.I.02.1. P.Seq 


F 


M00005359B:D09 


CH02COH 


1120 


7296 


RTA00002900F.d.08. 1. P.Seq 


F 


M00004856D:F09 


CH02COH 


1121 


11546 


RTA00002905F.0. 16. 1 .P.Seq 


F 


M0O021678A:H03 


CH03MAH 


1122 


15748 


RTA0000290 IF.h. 12. 1 .P.Seq 


F 


M00005515D:F02 


CH02COH 


1123 


5591 


RTA00002903F.p.20. 1 .P.Seq 


F 


M00007141C:B05 


CH02COH 


1124 


9433 


RTA00002935F.p. 14. 1 .P.Seq 


F 


M00055424B:H06 


CH17C0HLV 


1125 


9654 


RTA00002924F.i.09. 1 .P.Seq 


F 


M00039654C:CII 


CH09LNL 


1126 


21914 


RTAO00028S9F.LO6. 1 .P.Seq 


F 


M00001550A:H06 


CHOICOH 


1127 


4277 


RTA00002927F.h. 13. 1. P.Seq 


F 


M00039642A:A08 


CH12EDT 


1128 


12362 


RTA00002929F.h.24. 1 .P.Seq 


F 


M00040391A:G05 


CH14EDT 


1129 


449 


RTA00002924F.e.22. 1 .P.Sea 


F 


M00039471D:G10 


CH09LNL 


1130 


1820 


RTA00002922F.n.l 1.1. P.Seq 


F 


M00039133C:F12 


CH09LNL 


1131 


12159 


RTA00002930F.b.24. 1 .P.Seq 


F 


M00042894C:All 


CH15C0N 


1132 


25106 


RTA00002903F.d.2 1 . 1 .P.Seq 


F 


M00006907B:C06 


CH02COH 


1133 


2245 


RTA000029 l7F.b.20. 1 .P.Seq 


F 


M00032676C:C10 


CH08LNH 


1134 


14388 


RTA00002894F.h.09.1. P.Seq 


F 


M0000399SB:G10 


CHOICOH 


1135 


12219 


RTA00002898F.d.22. 1 .RSeq 


F 


M0000432SA:D01 


CHOICOH 


1136 


4726 


RTA00002935F.d. 16. 1 .P.Seq 


F 


M0005453SD:C12 


CH17C0HLV 


1137 


19479 


RTA0000289 IF.e. 15. 1 .P.Seq 


F 


M0000168SB:B11 


CHOICOH 


1138 


13280 


RT A00002 88 8 F. h .OS . 1 . P.Seq 


F 


M00001465CA02 


CHOICOH 


1139 


4270S 


RTA0000290 lF.h.07. 1 .P.Seq 


F 


M00005511A:F05 


CH02COH 


1140 


2022 


RTAO00O2S96F.f.O9. 1. P.Seq 


F 


M00004155C:A10 


CHOICOH 


1141 


7281 


RTA00002929F.f.22. 1 .P.Seq 


F 


M00040351A:COS 


CHI4EDT 


1142 


3241 


RTA00002919F.n.09.1.P.Seq 


F 


M00033223C:G04 


CHOSLNH 


1143 


16161 


RTA00002930F.h.02.1. P.Seq 


F 


M00055925D:B07 


CH15C0N 


1144 


2766 


RTAO0O02935F.p.2i . 1 -P.Seq 


F 


M0005547^D:B01 


CH17C0HLV 


1145 


11175 


RTAOOO02SS6F.b.O5. l.P.Seq 


F 


M00001340D:F07 


CHOICOH 


1146 


7223 


RTA00002923F.i.06.l. P.Seq 


F 


MO003927SC:DO3 


CH09LNL 


1147 


6786 


RTAOOO029 L 7F.O.05 . 1 .P.Seq 


F 


M0003279:C:B01 


CHOSLNH 


1148 


186651 


RTAOOO02923F.d. 21.1 P.Seq 


F 


M00O39219B:C0S 


CH09LNL 


1149 


7878 


RTA00002930F.f.05. t P.Seq 


F 


M00055724D:C07 


CH15C0N 


1150 


12624 


RTAOOOOZ935F.p. 11.1. P.Seq 


F 


M0005542?A:B0S 


CH17C0HLV 
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t rn 
LU 


CLUSTER 


CCA M A \A n 


(JKlfciN I A I IUIN 


^ LU!^ C ID 


LIBRARY 


1151 


23018 


RTA00002886F.f.2L l.P.Seq 


F 


M00001 i B.El I 


CH01COH 


1152 


186756 


RTA0000291 lF.g.23.l.P.Seq 


F 


M0002696! A:B06 


CH04MAL 


1153 


4177 


RTA00002902F.h.07. l.P.Seq 


F 


M000066 .■<A:A03 


CH02COH 


1154 


10430 


RT A00002894F.g.2 1. l.P.Seq 


F 


M00003996B:H07 


CH01COH 


1155 


31280 


RTA00002903F.k.08. 1 .P.Seq 


F 


M0000700. A:E04 


CH02COH 


1156 


19098 


RT A00002925F.e.23. 1 .P.Seq 


F 


M00039861C:B12 


CH09LNL 


1157 


24105 


RT A00002932F.a.06. 1 .P.Seq 


F 


M000425S5D:EIO 


CHI SCON 


1158 


7750 


RTA00002935F.i. 02. l.P.Seq 


F 


M000348^6C:D03 


CH17COHLV 


1159 


14582 


RTA00002898F.d.07. l.P.Seq 


F 


M0000432-A:D05 


CH01COH 


1160 


21356 


RTA000029l7F.j. 19. l.P.Seq 


F 


M000327^jA:C07 


CHOSLiVH 


1161 


16210 


RTA00002930F.k. 17. l.P.Seq 


F 


M00036j4r D:A04 


CHbCON 


1162 


2012 


RT A000029 1 1F.0. 10. 1 .P.Seq 


F 


M00027bvC:F07 


CH04M.AL 


1163 


5391 


RT A00002909F.p.2 1 . 1 .P.Seq 


F 


M000227j3D:G08 


f TA^ 1 1 iff 

CHOjMAH 


1164 


10172 


RTA00002886F.a.05. l.P.Seq 


F 


M000O13j?3C:BO2 


CH01COH 


1165 


16403 


RTA00002935F.p. 15.1 .P.Seq 


F 


M0O05542-:D:GO5 


CH17COHLV 


1166 


21920 


RTA00002886FJ.05. 1 .P.Seq 


F 


M0000136IA:C12 


CH01COH 


1167 


7070 


RTA0000292 lF.c.06. l.P.Seq 


F 


M0003337-;D:C07 


CH09LNL 


1168 


45734 


RTA0000290lF.j. 14. l.P.Seq 


F 


M0O0O5569D:G09 


CH02COH 


1169 


12362 


RTA00002929F.i.0K I P.Seq 


F 


M0004039iA:G05 


CH14EDT 


1170 


9405 


RTA00002S92F.k.04. l.P.Seq 


F 


M00003S30C:D02 


CH01COH 


1171 


6507 


RTA00002922F.O.05. l.P.Seq 


F 


M00039140A:F05 


CH09LNL 


1172 


10735 


RTA00002925F.b.24. l.P.Seq 


F 


M00039S22A:H02 


CH09LNL 


1173 


21177 


RTA00002935F.d. IS. l.P.Seq 


F 


M00054542B:A10 


CH17COHLV 


1174 


14950 


RTA00002894F.m. 18. l.P.Seq 


F 


M0000404"D:F12 


CH0ICOH 


1175 


10762 


RTA000029 1 7F.O.0S. l.P.Seq 


F 


M0O032793A:G06 


CH0SLNH 


1176 


23170 


RTA000028S7F.f. 15. 1 .P.Seq 


F 


M00001396B.B01 


CH01COH 


1177 


8487 


RTA00002887F.f. 16. 1. P.Seq 


F 


M00001396B:B12 


CH01COH 


1178 


185798 


RTA0000291 lF.k.06. l.P.Seq 


F 


M00027050A:B02 


CH04NLAL 


1179 


8976 


RTAOO0O2896F.h.O3. l.P.Seq 


F 


M0000416IB:G07 


CHOICOH 


1180 


12159 


RTA00002930F.C.OI. l.P.Seq 


F 


M00042S9-:C:A1 1 


CHI 3 CON 


1181 


7788 


RT A00002932F.b. 13. 1 P.Seq 


F 


M000430l"C:D08 


CH18CON 


1182 


43336 


RT A000029 1 7F.d.09. 1 .P.Seq 


F 


M000326SSC:A03 


CH08LNH 


1183 


10313 


RT A00002902F.k. 19. l.P.Seq 


F 


M000067-i0B:A09 


CH02COH 


1184 


. 4588 


RT A00002 89 1 F.o. 1 1. 1 P.Seq 


F 


M000037S2A:B02 


CHOICOH 


1185 


18090 


RTA00002925F.1. 17. l.P.Seq 


F ' 


M000399S1D:B01 


CH09LNL 


1186 


185994 


RTA0000291 lF.p.07. l.P.Seq 


F 


M00027177B:D04 


CH04M.AL 


1187 


166276 


RTA00002908F.h.03. l.P.Seq 


F 


M0002243SC:H09 


CH03MAH 


1188 


15984 


RTA00002932F.a. 10. l.P.Seq 


F 


M000426:iC:C04 


CH18CON 


1189 


13242 


RTA000028S9F.i.ll.l.P.Seq 


F 


M00001550D:BI 1 


CHOICOH 


1190 


6840 


RTAOO0O2935F.I.06. 1 .P.Seq 


F 


M00054Sc6B:C03 


CHi/COHLV 


1191 


17265 


RTA00002935F.L04. l .P.Seq 


F 


M0005510SB-.A02 


CHI/COHLV 


1192 


12542 


RT AOO0O29j3F.b. 1 7. 1 .P.Seq 


F 


M0004j 1^_C:B 10 


Ln 19LCJr 


1193 


1568 


RT AOO0O2928F.d. 10. l .P.Seq 


F 


M00040r4D:G06 


CH13EDT 


1194 


8721 


RTAOO0O2901F.k.23. l.P.Seq 


F 


M00005606D:B12 


CH02COH 


1195 


13519 


RT A00002S98F. j. 1 9. 1 .P.Seq 


F 


M000043oSA:Bll 


CHOICOH 


1196 


4471 


RTA00002890F.d.l4. l.P.Seq 


F 


MO000L60OB:GOI 


CHOICOH 


1197 


11357 


RTA0000293 lF.c.06. l.P.Seq 


F 


M00042S'SD:F05 


CH16COP 


1198 


1 1804 


RTA00002935F.m. 17. 1 .P.Seq 


F 


M00055:39D:Fn 


CH17COKLV 


1199 


6999 


*RT AO00O2S96F.C.2 1 . 1 .P.Seq 


F 


M00004U6A:C11 


CHOICOH 


1200 


4408 


RTAO00O2S97F.a.O2. l.P.Seq 


F 


M0000420"C:A04 


CHOICOH 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


P01 

1 w 1 


4618 


RT A0000°935F i 'M 1 P Sea 


F 


MOOOSSOfLrC-HOS 


1 7POUT V 


120^ 


185841 


RTA0O0O°9PF 0 08 1 PSea 


F 


M000'?76O0R*CO7 




1203 


1278 


RTAOOOO^PF i 15 1 PSea 

IX I. 7 4 _ I • J • 4 •/ > 1 -4 .JVU 


F 


M000'>74^9rR 10 




1204 


19677 


RTA0000' 7 9' ) 9F h 16 1 PSea 


F 


M000403 8 4 R ■ F04 


PH 14FnT 


1205 


17539 


RT A0O0O' 7 9O9F c 06 1 P Sea 


F 


M000'>'>56" ; B-rO8 


CH03MAH 


1206 

*■ ^ vx»r 


1 1390 

k k *J S \J 


RTA0000' ? 89 o F k 03 1 PSea 


F 


M00003830B C06 


CH01COH 

Vw 4 1 V7 l x^ X7 1 1 


1207 


10735 

4 V / «7_/ 


RT A0000°9°5F c 01 1 P Sea 


F 


M000398^° A-H0"> 


CH09LNL 


1208 


3039 

•J ^ -J J 


RT A0000 0 887F i 07 1 P Sea 


F 


M00001406D-F06 


CHOI CO H 


1209 


181718 


RT A0000''905F n ">0 1 P Sea 


F 


M000° 1668D- A03 


CH03MAH 

Xr 4 i \jj l * l ru ■ 


P10 


6957 


RTA0000°917F n 18 1 PSea 


F 


MO0O3^787D-CO5 


CH08LNH 


121 1 

If 4* A ^ 


23673 


RT A0000^930F a 14 1 P Sea 


F 


M0004^ 5 5 7 D ■ B06 


CH15C0N 

W A A A X_r X^ A ~ 


1212 


11405 


RT A0000°91SF b 13 1 PSea 


F 

As 


M0003^8" 1 AE09 


CHOSLNH 

w A Aw W * " A A 


1213 


10^56 


RT AOOOO^SSSF d 16 I PSea 


F 


M00001449B*HI0 


CH01CCH 

X** A A V/ A X^ A A 


1214 


25563 


RT A0000°89 IF b °3 1 P Sea 


F 


M0000 1 675B • D06 


CH01COH 

V m- A A^y A X^ Xri/ A A 


1215 


2669 


RT A0000^8S6F I 03 I P Sea 

ax a « i w \j m v> a » l • v^/ ^/ » a *a >^x*u 


F 

A 


M0000 1 368 A B 07 


CHOICOH 

X^- A k \J k X^r W A A 


1216 


185877 


RT A0000°9 1 IF b 1 1 ^ P Sea 


F 


MOOO'? 3389A* G04 


CH04MAL 


P 17 


13 14 


RT A0000^930F c 08 I P Sea 


F 


M000549 I 1 D F06 

i" 1 UuuJ ^ 7 1 t L-S ■ 1_ W V7 


CH15COIV 

v 1 1 i -J > ■ V7 .. ^ 


P18 


25843 

J O** J 


RT AOOOO^OSF o 05 1 P Sea 


F 


M000^^509 A ■ H0° 


CHOI MA H 

V— £ l\J J I v l/vl 1 


PI9 

1—17 


1794 

1 / 7*T 


RT A00007074F o 06 I P Sen 


F 


\I000395f\0r H06 


CH09I Ml 


12^0 


2^038 


RT A0000"*896F a ^0 I P Sea 


F 


M00004 1 36r - r n 


CHOICOH 

L & IV/ lv<\7[ 4 


P' 7 1 


vU 1 1 


RT A0000°9'>4F f P IP Sea 


F 


M00039478r-R0^ 


CH09I ML 


POO 


4 1 087 


RT -V 0000^90 1 F n 06 1 P Sen 


F 


M0000567 s n ■ D09 


CH07POH 

x— n v/_x-. vji i 


12^3 


18534 

l O-J J*+ 


RT A0O0O">90SF o 16 I P Sea 


F 


M00O7O SPR- a 09 


CH03MAH 


1004 


1444 
1 "ttt 


RT AOOOO^^F a P 1 P Sea 


F 


^000383890*0 10 


CH09LVT 

\ l IV 7 L.I L» 


P 7 5 


1078 

lx/ 1 O 


RT AOOOO°893F IP Sea 


F 


M0000397^r-F07 


CHOICOH 


P^6 


8632 


RT A0000^896F h 09 I P Sea 


F 


M00004 1 6 " R * CO i 


CHOICOH 

V_ 1 1V7 1 \_ X71 1 


P°7 


10504° 


RT A0000">917F o 11 I PSea 


F 


M0003779-SC' A 03 


CHOSLNH 

X— ■ A iVUA^l "11 


1228 


6878 


RTA0000 n 9PF 11 I I P Sea 

l\ i. . \ VJ WWV/ -~ 7 L ^,1 .1.1 1.1. 1 .^>x«U 


F 


M000^75 1 "O*F06 

IV 1 \j\J\J -~ / J 1 _ 1_7 . 1 V/U 


CH04M AL 


1TJO 
i — 7 


— j\j j 7 


R A 0000^9 ! 0 F i 05 I P Sei 


F 
i 


M000°7 ^8 1 R R04 


CH04M AL 


P30 


19635 

l 7UJJ 


RT A0000">S96F c IS 1 P Sen 


F 


M00004 I44O-R0"* 


CHOICOH 

X— . 1 1U I X— . X-/ i A 


P3 1 


7° 17 


RT A0000 n 9'?6F d I S ~> P Sen 


F 
i 


M00040094 R P08 


CH09LNL 


1232 


4910 


RT A0000^930F cr 04 1 P Sea 


F 


MOOOS58 1 0P-O03 

iVlxyVJL/J JO 1 \J v_ . L7UJ 


CH 15CON 

V— 1 1 1 *7 X— V7 i ^ 


1233 


16945 


RT A0000 n 9' 7 IF o 01 1 PSea 


F 


M0003S"*90A-D I 

1» 1 W V7V> ^ \J ~S W. V . 1-7 i v 


CH09L\ r L 

* — 1 IV/ 7 1— . ' L— ■ 


P34 


24790 


RT A0000°890F e ^ I I P Sea 


F 


M0000 1 606 O * D06 

i~ lUUUU 1 VJ vy V7 L7 • 1_7 \J\J 


CHOICOH 

X^> X Iv lx<wl 4 


1235 


9079 I 


RT A0000"'93^F a 07 I P Sea 


F 


M0004' ) 586 A ■ B 0 1 


CH 18C0N 

Xw A A A \J X_^ X^ a ^ 


1236 


14861 


RT AO000°9OIF o IS I PSea 


F 

A 


MO0OO5505BE01 


CHOICOH 

V# A A WwX,^ W L A 


1237 


°45'> 


RT AOOOO"^ 1 F b 0 n IP Sea 


F 


M0003330^BF 10 


CH09LNL 


P38 


19*>69 


RT A0000^S87F o 1 1 1 P Sea 


F 


M0000I430B COI 


CHOICOH 

V— ^ 11U IXrV/l 1 


P39 


160^9 

1 v7V7— 7 


RT A0O0O^93OF i 17 1 P Sea 


F 

A 


M00056"»44AB06 


CH15CON 

x^ Alt V i ^ 


P40 


3038 


RT A0000^9 no F m 04 I PSea 


F 


M00039 P I D-F07 

1'IUUUJ 71.1 L7 . i_ VJ* / 


CH09LNL 

V 1 lu 7 4_>1 ~ 1^. 


P41 

1 — ^ 1 




RT A0O00" , 9'>' ) F a^O 1 P Sea 

1X1 , \ ww \J\J — y — ~ L » X *~\J • L » 1 iJVU 


F 


VI 000 1 90 5 6 R Pi 0 1 
iywj\j\j j 7uj . V— j w i 


CH09LNL 


1242 


15956 


RT A00002 891F.i.23.l.PSeq 


F 


M00003761BB0* > 


CHOICOH 


1243 


15524 


RT.A00002930Fs.09. LP.Seq 


F 


M0005581SB:D01 


CHI5C0N 


1244 


21550 


RT A000O2935F.f.2 1 . 1 .P.Seq 


F 


M00054725C.D09 


CH17C0HLV 


1245 


17567 


RT.A0000291SFp.il. LP.Seq 


F 


M00033006A:F10 


CHOSLNH 


1246 


20293 


RT A00002SSSF,j.20. 1 .P.Seq 


F 


M0000U77D:G09 


CHOICOH 


1247 


9520 


RT A000029 2 7F .h . 1 5 . 1 .P. Scq 


F 


M00039642C:F0S 


CH12EDT 


1248 


2700 


RTA00002S89F.e.21.l.P.Seq 


F 


M00001539C:F12 


CHOICOH 


1249 


25S91 


RT A000O29O9F.p.23. 1 .P.Seq 


F 


M00022740C:K11 


CH03MAH 


1250 


4298 


RTA0O0O29OSF.C.22.1. P.Seq 


F 


M00O223S3C:A!2 


CH03M.AH 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


1 9-\ 1 


-U4- 1 _ 


DT A nAAfPQflQP a fi7 t P N*.»r» 


c 
r 


Mnno9 9 s on r . Ffis 

IV1UUU— — J7UD . JuUJ 


p wn7Vff a w 


i7<7 


7Q/I 1 7 

zy4L J 


D"T \ fWYYXinAF m 77 1 p C^ri 

K 1 AUUUU_yuor.iTi._ j. i .r.ocq 


r 


1VIUUW— — UU7 IJ.\„ i — 


pLJn7V,l A u 




1 97 1 x 


DT A nnnA7Qfl7P 1 7,1 7 p C M 


F 
r 


iVIUUU-i — — 4UL/, Oil 


L, riU J lvl ATI 


1 7 Sil 




K I Auuuuzy jur. 2. U**._.r.ocq 


F 
r 


IVIUUUJ JO 1UV_ . L^UJ 


ru i <pp»M 


19^ 


nn i q 


K 1 .-\UUUU-7-4r.C. ID. 1 .r . Ocq 


F 
r 


1V1UUU J 7-t _ 0 D . L/UO 


PHOOI NTT 


1 9SA 

I -JO 


IUjUI 


PTArinnn7Q';nF h 77 1 p s*.»n 
K 1 .-\uuuu-7Jur.n.ij. i.i.jeq 


F 






I9S7 

L £ J / 


1 1 7 1 J. 


dt Annnn7Q7' ; \P tr fn 1 p 

k. i .Auuuu-7jjr.n.wj. 1 .r.jct] 


F 


Moons so" 1 " a - f 1 1 


PH17POMT V 


I .Jo 




DTAfinnn9Q97F h IS I P 9po 
r\ i 7i / r.u. i j. 1 .r.ocq 


F 
r 


Mnnn7 o ^ ~ ~ r • woo 




|7nQ 

I -J7 




rt Arinnn7QfuF rn 1 p 

IX 1 .AUwl)i7Lrrri -VJ. 1 .1 . JCLJ 


F 




PHO^POH 


hah 


AQG 1 


DT A Pinnn7Q77F i 19 1 P <\*n 


F 
r 


iVl\JVUJVvyo 1 D-V-UH 


PHOOJ NIT 


1 .0 1 


1 iyZo 


dt AfinnrnofiAF w ns i p 


F 


IV1UUU— IV 1L.O 1 1 


PHO^M AH 
Lnuj iv irvn 


l^A") 
l-^OZ 


79.77A 
ZiSZZO 


dt a fwwnornF n 7fl I P C^n 


F 
r 


iV/fn/vyjT'^ a** R • RDA 

iVlUUUZZ — D_ O DUO 


P Hf)"; .Vf A H 
v_ nu J ivi.-Mi 


19A"* 
1 -ID J 


1 An<Q 


DTAnnnn9Q7SF ; 17 i p 

tv l AUUUUZVJjr.J. 1 j. 1 .r.jci] 


F 


iviuuujhv ov_.ru i 


PR I7POHT V 


1 9AJ. 


77^7 
ZZjZ 


d t a nnnn9 q q af l- 74 1 p *z*n 


F 


Mnnnn 1 a • ahs 

IV1UUUU 1 JOo.A. AUO 


PHD1 POH 


IZOj 


/in co 


dt a nnnn7Q7CF no 1 d 


c 
r 


\yfnnn</i7 1 in -Pin 

IVlUUUj** / l-*l5.01U 


PI4 1 7Pni47 V 
Lnl / v_.VJnJL. v 


1 7£A 
1Z0O 


7 1 IOC 

Zl fyj 


K 1 AUUUUZVU 1 r.D. 10. 1 .r . jCQ 


c 
r 


\yf nnnn> i p a dih 

1V1UUUUJ-+'*- A.D 1U 


PH07POH 


1 7A7 
1ZO / 


13U4y 


dt a nrinn"t07^F ; in i d c*>n 
K 1 .-\UUUUj.VJjr.J. 1U. 1 .r.ocq 


c 
r 


Kjfnnn^jO"'^ R-F 
iviuuuj**y . jd.jci- 


PH I7PORT V 


1 7AQ 
IZOo 


jjOj 


dt a nnnn7Q7np o n7 i p <\^n 


c 
r 


iv/fnnri'iTon x a fho 
iviuuu-*— vuo a. ruv 


PH 1 SPOiNJ 


1Z0V 


ZU4V j 


K 1 AUUUUZvjjr .a. 14. l.r.oeq 


r 


N/fnnn^i 7n — p r\ 1 7 


r u mpop 


1 77n 

iz /u 


ZUZj / 


K 1 AUUUUZyj4r.a. 14. l.r.oeq 


F 


iVlUUU4 j4yrL_ . HUj 


PFT9nPn(4T V 


IZ / 1 


lojyz 


K I AUUUUZoVvr.a.Uj. 1 .r.oeq 


r 


mUUUU4-+ 1 n o . A 1 1 


pwnt rnn 
v_riu i LL/n 


ill I 


0797 


dt \ nnnmrnnc T7 i d c^s* 
K 1 AUuUU^vjUr.c.Zj. 1 r.oeq 


r 


\ / Ann; r ic 'r~.UA/C 
MUUUJ D 4 ? c C . rlUO 


PH 1 ^PONI 


IZ / J 


Loll 


DT v AAHAO OO 1 F ^ 7 1 ID C»*-i 

K 1 AUUUUZoVir.a.zi.i.r.jeq 


n 
r 


iviuuuu 1 Oc-UXW 


PHOt POH 
v_ nu i v_vjn 


LZ /4 


17j0j 


R I AOUUUZvjjr.r.Uo. l.r.beq 


r 


MUUUj40 ?U. UUj 


L.rl 1 / LUrlLV 


l 77< 
IZ/J 


140JV 


DT \ AriAA7Qr\<C k 17 1 D Cj»^ 

K I AUUUUZVUjr.n. 1Z. l.r.oeq 


r 


\>innnnQn"* * Pi nni 

IVlUUUUoU. _ U.UUl 


PHn7\-l A W 

L. nUJ:*I.Tjl 


I 77A 
IZ /O 


y i4o 


K I AUUUUZVj4r.a. 1U. 1 .r.oeq 


r 


MUUU4 J4CT L-.n 1 I 


L- rl^U^VJiLL V 


I 777 
lift 


I A£QO 

luooy 


K 1 AUUUUZVjUr.n. IV. l.r.oeq 


F 

r 


ivlUUU J OUL — £5 . V- UJ 


PH 1 SPONf 


I 77Q 
IZ /O 


I 1 <OA 

I I JVC 


dt a nnnrnQonF ^ 77 \ p 

K 1 AUUUU_oVUr.CZ J. 1 .r .oeq 


c 
r 


iVIUUUU 1 OL* A . CLH 


PH01POH 


I 77G 


7777 1 

Zj /j 1 


K I AUUUU-V Jur.2. 1 3. 1 .r .ofiq 


c: 
r 


IVIUUUJ jo - l-/ A_ U^. 


PH 1 ^PON 


IZoU 


7< A 70 

Zj4Zy 


K t AUUUUZVjUr.n. 1U. t .r.oeq 


r 


IVIUUUJ J7CV. V_ . DU4 


PH 1 SPON 
v_ n i jv^ ui™ 


I 70! 
IZo L 


10 1U 


dt \ nnni^ioi i c k o>i i d c»n 
K 1 AUUUUZyj lr.D.Z4. l.r.^eq 


r 




PH 1 APOP 


I 7Q7 

1 -Oi 


1 1 /o 


dt Annnn7Q7^F i in i d Q^r> 
K 1 AUUUUzyjjr.a. 1U. 1 .r .oeq 


r 


a ,f Ann i° a < ~ p ■ n nA 

£vlUUU*+^.-+_ v_ DUO 


PHI 7Pnf4T V 


1 7Q7 
IZoJ 


ZjD78 


dt \ nnnmmnc ^ 77 t d Cja 

k 1 Auuuuzy jUr.a.zz. i .r.oeq 


r 


\ .f AAA 17 < — C \ PitlQ 
IviUUU4_j - A-UUV 


PH 1 SPON 


1 7 0,1 

!Zo4 


I7Zj8 


R 1 AUUUUZyjZr.a. lo. 1 .r.beq 


r 


\a nnn 1 7Q —i v - q i 
MUUU4Zy . .DUI 


ph i crnv 

Ln loL Kjy* 


not 

IZoJ 


1610 


d t \ nnnmm t n* « n t id Cm 
R I AUUUUZyj Ir.c.Ul. 1 r.oeq 


r 


X/fnnn 17 c>vp-p i \ 
MUUU4Zo_ , ^L. 0 1 1 


PH I APOP 
\_tl iuk_ vjr 


IZoO 


1 nice 
loJoo 


dt \nnnmo77F i n/i l d c Jn 
R I AUUUUZyjZr.a.U4. 1 .r. ->eq 


r 


\/t nnn J7 <c; \ ui i 
IvlUUU**- jo? A.ri 1 1 


PH 1 nPON* 
Lnl OV_ \J.v 


IZo / 


19/09 


R I AUUUUZyjZr.a. IZ. 1 r i>eq 


F 

r 


\vfnnn t7Qc * rvp 17 

IvlUUUWZV- . ^J.Li l - 


PH 1 ^PO\* 


1 7 QQ 

IZoo 


1 IUZ / 


d t \ nnnmontr k i n t d 

R 1 AUUUU_y JZr.D. 1U. 1 r ,beq 


c 
r 


V/fnnn-iin! ts.pn* 
MUUU4jU I _*D. cUj 


PH 1 x'PON' 

Lnl uL KJ:* 


t 7QG 


2j4j I 


dt \ fvmn^onc o. no i d Ci/i 
R I AUUUU_yjjr.a.uy. 1 r oeq 


r 


. MUUU4_0 i D.tUl 


ph i qphp 


1 90n 


7 177 I 
Z J / 3 1 


dt a nnnn^oinF « iq i p 

R 1 AUUUU_yjUr.2. lo-Z.r.5Cq 


r 


iVIUUU J JO. .* u . L U- 


PH 1 SPO\* 


1 7Q 1 
IZy 1 


.it on o 


d t a nnnn^o i t f l- n7 l p 
R i auuuu_vi ir.K.u/ . i .i -ocq 


p 
r 


iVIUUU— / U_ _ A . C 1 U 




1999 


J.J O 1 


DT AOOnn^Oi^F h 09 l P Sen 


F 


M000429ScD:E03 


CHI SCON 


1293 


42 


RTA00002934F.a. 1 1.1 .P.Seq 


F 


M000434V A:C10 


CH20CORLV 


1294 


1447 


RTAOO002935F.d. 14. 1 .P.Seq 


F 


M0005453c3:B01 


CH17C0HLV 


1295 


10449 


RTA00002935F.O.20. 1. P.Seq 


F 


M0005539:DDll 


CH17C0HLV 


1296 


35359 


RTA00002935F.h.lLl.P.Seq 


F 


M00054SL"O.All 


CH17C0HLV 


1297 


19657 


RTAOOO02935F.L20. 1 P.Seq 


F 


M000551ccC:DlO 


CH17C0HLV 


1298 


12659 


RT A00002930F. i .21. 1. P.Seq 


F 


M00056I33A:E11 


CHI SCON 


1299 


9081 


RT A00002934F.a.22. 1 P.Seq 


F 


M000436ZCA:B01 


CH20COKLV 


1300 


170S4 


RTAO00O2935F.a. 14. 1 .P.Seq 


F M000425:C3:H04 


CH17C0HLV 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LEBR.ARY 


1301 


11972 


RTAO0O02935F.b. 10. l.P.Seq 


F 


M00043336D:B03 


CH17C0HLV 


1302 


11077 


RT A000029 35 F.c . 1 2 . 1 .P . Seq 


F 


M00043402B:G07 


CH17C0HLV 


1303 


126414 


RTAO0O02885F.a.01. l.P.Seq 


F 


M00042350A:A05 


CH16C0P 


1304 


113291 


RTA00002935F.m. 1 5. 1 .P.Seq 


F 


M00055252A:E08 


CH17C0HLV 


1305 


13224 


RT A 000029 35 F.f. 15. 1 .P. Seq 


F 


M00054693A:E11 


CH17C0HLV 


1306 


14883 


RTA00002930F.k. 14. l.P.Seq 


F 


M00056320B:A03 


CH15C0N 


1307 


13363 


RTAO0O02935F.a.02. 1 .P.Seq 


F 


M00042352D:B03 


CH17C0HLV 


1308 


16869 


RTAO0002889F.C.2 1 . 1 .P.Seq 


F 


- M00001533C:GI1 


CH01COH 


1309 


16 


RTA00002935F.a.06. l.P.Seq 


F 


M00042449B:F05 


CH17C0HLV 


1310 


4359 


RT A00002903 F. j . 1 6. 1 . P. Seq 


F 


M00006994C:F06- 


CH02C0H 


1311 


20726 


RTA00002908F.a.l7.1.P.Scq 


F 


M0O022363C:D05 


CH03MAH 


1312 


13713 


RTA00002924F.I.09. 1 .P.Seq 


F 


M000396S6C:C01 


CH09LNL 


1313 


29271 


RT A00002935F.d.20. 1 .P.Seq 


F 


M0005454SCH06 


CH17C0HLV 


1314 


6237 


RTAO0OO2935F.f. 14. l.P.Seq 


F 


M00054686A:F10 


CH17C0HLV 


1315 


3472 


RTA00002922F.n. 12. l.P.Seq 


F 


M00039134D-.F08 


CH09LNL 


1316 


186798 


RTA000029 1 1 F. f. 1 1 . 1 .P.Seq 


F 


M00026914C:H09 


CH04MAL 


1317 


13193 


RTA00002886F.1. 16. l.P.Seq 


F 


M00001369A:G06 


CH01COH 


1318 


3554 


RT A000029 1 9F.i. 14. 1 .P.Seq 


F 


M00033149B:E10 


CHOSLNH 


1319 


19991 


RTA00002908F.h. 11.1 .P.Seq 


F 


M00022446C.-H06 


CH03MAH 


1320 


173046 


RTA00002901F.0. 19. l.P.Seq 


F 


M00005703D:G10 


CH02COH 


1321 


21798 


RTA00002932F.b. 12. l.P.Seq 


F 


M00043016B.F09 


CHI SCON 


1322 


11303 


RTAO0O02898F.L 11.1 .P.Seq 


F 


M00004359A:E01 


CH01COH 


1323 


4026 


RTA00002915F.m.02.2.P.Seq 


F 


M00032494C:H08 


CH08LNH 


1324 


94859 


RTA00002909F.i.23.1.P.Seq 


F 


M00022656D:D07 


CH03M.\H 


1325 


12315 


RTA00002907F.m.0 1 . 1 .P.Seq 


F 


M00022240D:B1I 


CH03M.AH 


1326 


4822 


RTA00002909F.1. 16. l.P.Seq 


F 


M00022690A:A07 


CH03MAH 


1327 


97129 


RTAO0002909F.1. 13. l.P.Seq 


F 


M000226S4A:E06 


CH03NLAH 


1328 


15996 


RTA00002897F.I.09. 1 P.Seq 


F 


M000042S1A:C04 


CH01COH 


1329 


7209 


RTA000029 1 SF.c.0 1 . 1 .P.Seq 


F 


M00032S35D:G04 


CH08LNH 


1330 


111888 


RTA00002902F.H.08. l.P.Seq 


F 


M0000667SC:C02 


CH02COH 


1331 


15642 


RTA00002902F.2.06. l.P.Seq 


F 


M00006646A:A07 


CH02COH 


1332 


20016 


RTA000029 16F.f.05. l.P.Seq 


F 


M0003256"B:G05 


CHOSLNH 


1333 


21603 


RTA00002902F.a.05.1.P.Seq 


F 


M0000576?D:A01 


CH02COH 


1334 


156903 


RT A00002907 F. i .09 .2 .P. Seq 


F 


M00022200B.B05 


CH03MAH 


1335 


1425 


RTA000029 16F.b. 19. l.P.Seq 


F 


M00032541C:G03 


CH08LNH 


1336 


186061 


RTA000029 1 1 F.e.24. 1 .P.Seq 


F 


M00026900A:H07 


CH04M.AL 


1337 


20717 


RTA00002907F.0. 19. 1 .P.Seq 


F 


M000222*3A:E03 


CH03MAH 


1338 


12586 


RTA000028S7F.a.09. l.P.Seq 


F 


MOOOO 1 3 S5 A :E07 


CH01COH 


1339 


19719 


RTA00002914F.h.23. l.P.Seq 


F 


M0002S212D:C05 


CHOSLNH 


1340 


474 


RTA00002917F.S. 15. l.P.Seq 


F 


M00032727A:E04 


CHOSLNH 


1341 


11907 


RTA00002923F.O.07. 1 .P.Seq 


F 


M000393SIC:C07 


CH09LNL 


1342 


6806 


RTAO0002928F.d.O2. 1 .P.Seq 


F 


M00040169A:G06 


CH13EDT 


1343 


13146 


RTAO00O2892F.r\ 10.2.P.Seq 


F 


M0O0O3SUA:GO5 


CH01COH 


1344 


16686 


RTAO00O29 i9F.f. 14. l.P.Seq 


F 


M000330T2A:A09 


CHOSLNH 


1345 


6823 


RTA0000288SF.a.04. 1 .P.Seq 


F 


M00001433B:E02 


CHOICOH 


1346 


43029 


RTAO0002897F.d.03. 1 .P.Seq 


F 


M000O4Z:5D:EO3 


CHOICOH 


1347 


147S9 


RTA00002935F.k. 1 1. 1 P.Seq 


F 


M00055055CF01 


CH17C0HLV 


1348 


186061 


RTA0000291 1 F.t'.Ol. l.P.Seq 


F 


M00026900A.H07 


CH04M.AL 


1349 


12823 


RTA0000292 IF.g. 24. 1 .P.Seq 


F 


M00033434D:FO5 


CH09LNL 


1350 


25844 


RTA0000290SF.k.23.l.P.Seq 


F 


M000224"-iB:C08 


CH03MAH 



I 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


\ CLONE ID 


LIBR.-VRY 


1351 


47793 


RTA00002930F. 2 .03.2.P.Seq 


F 


M00055SO9A:B09 


CH15CON 


1352 


7695 


RTA00002S92F.n. 18.2.P.Seq 


F 


M00003845A:C07 


CHOICOH 


1353 


16997 


RTA00002922F.k. 15. l.P.Seq 


F 


M000391O"A:E12 


CH09LNL 


1354 


25441 


RTA00002906F.L08. 1 .P.Seq 


F 


M000219S1A:C02 


CH03MAH 


1355 


4303 


RTA00002897F.a20.KP.Seq 


F 


M00004295D:C07 


CHOICOH 


1356 


5741 


RTA00002887F.C. 19. l.P.Seq 


F 


M00001390D:E02 


CHOICOH 


1357 


17264 


RTA00002900F.a. 1 8. 1 .P.Seq 


F 


M00004831CG11 


CH02COH 


1358 


11766 


RTA00002925F.f.20. l.P.Seq 


F 


M00039871C:G05 


CH09LNL 


1359 


136L8 


RTA00002893F.0. 15. 1 .P.Seq 


F 


M00003963D:F01 


CHOICOH 


1360 


13903 


RTA00002923F.C. 1 8. 1 .P.Seq 


F 


M0OO39204A:E09 


CH09LNL 


1361 


10673 


RTA00002927F.h,23. l.P.Seq 


F 


M000396-i6A:E06 


CH12EDT 


1362 


17412 


RTA00002932F.b. 1 1 . 1 .P.Seq 


F 


M00043015D:D05 


CHI SCON 


1363 


2218 


RTA000029 19F.a.20. 1 .P.Seq 


F 


M0003302SC:A02 


CH08LNH 


1364 


5858 


RTA00002923F.L01. l.P.Seq 


F 


M00039275B:E02 


CH09LNL 


1365 


2510 


RTA00002898F.b. 14. l.P.Seq 


F 


M00004316A:B03 


CHOICOH 


1366 


8050 


RTA00002900F.n.04. l.P.Seq 


F 


M000053S3A:C11 


CH02COH 


1367 


186538 


RT A00002929F.e. 1 8. 1 .P.Seq 


F 


M000403Z9A:H05 


CH14EDT 


1368 


25427 


RTA00002935F.n.20. l.P.Seq 


F 


M00055337B:C04 


CH17C0HLV 


1369 


24098 


RTA00002901 F.a. 10. 1 .P.Seq 


F 


M000054ZZD:H02 


CH02COH 


1370 


123823 


RTA00002905F.h.08. l.P.Seq 


F 


M0OO08O7ID:H03 


CH03iVLAH 


1371 


3644 


RT A0000290 1 F.c.03. 1 .P.Seq 


F 


M000054-ifD:D04 


CH02COH 


1372 


27783 


RTA000029 1 7F.a. 1 7. 1 .P.Seq 


F 


M00032666A:C02 


CH08LNH 


1373 


1682 


RTA000029 lOF.b.03. 1 .P.Seq 


F 


M00022801D:D09 


CH03M.-VH 


1374 


3200 


RTA00002887F.e.07. 1 .P.Seq 


F 


M000O1393C:F04 


CHOICOH 


1375 


8442 


RT A000029 1 7F.fi.23. 1 .P.Seq 


F 


iM0003273-B:El2 


CHOSLNH 


1376 


15353 


RTA000029lOF.e.l I. l.P.Seq 


F 


M00022Sf-C:G07 


CH03M.Aii 


1377 


6314 


RTA00002922F.b.06. 1 .P.Seq 


F 


M0OO3861SD:DO8 


CH09LNL 


1378 


93549 


RTA00002909FJ. 14. l.P.Seq 


F 


M0002266ZC:H04 


CH03MAH 


1379 


15496 


RTA00002906F.p.03. l.P.Seq 


F 


M0OO220S53:HO2 


CHOZmVLAH 


1380 


16572 


RTAO00O2S86F.k.O3. l.P.Seq 


F 


M0000136-A:C09 


CHOICOH 


1381 


74821 


RT A00002 S90F.p.2 1 . 1 .P.Seq 


F 


M00001663A:A12 


CHOICOH 


1382 


11315 


RTA00002SS9F.d. 1 2. 1 .P.Seq 


F 


M000015353:B10 


CHOICOH 


1383 


10859 


RTA00002S94F.C. 1 S. 1 .P.Seq 


F 


M000O395OD:CO6 


CHOICOH 


1384 


15391 


RTA000029 14F.f.04. 1 .P.Seq 


F 


M000281933:E07 


CHOSLNH 


1385 


23172 


RTA00002S96F.b. 18. l.P.Seq 


F 


M0000414iB:F08 


CHOICOH 


1386 


22510 


RTA00002S86F.1.05. 1 .P.Seq 


F 


M0000136SA.C02 


CHOICOH 


1387 


17156 


RTA00002934F.a.OS. 1 .P.Seq 


F 


M0004345f 3:C08 


CH20COHLV 


1388 


4593 


RTA00002S96F.0. 1 S. 1 .P.Seq 


F 


M0000420CC:A04 


CHOICOH 


1389 


2178 


RTA0000290 1 F.m.OS. 1 P.Seq 


F 


M000056ZcD:Gll 


CH02COH 


1390 


1015 


RTA00002933F.C.1 1.1. P.Seq 


F 


M000432i3A:D05 


CH19C0P 


1391 


26792 


RTA0000Z907F.a. 1 S. 1 .P.Seq 


F 


M0002210:-C:D05 


CH03M.\H 


1392 


27830 


RTA00002921F.C.07. l.P.Seq 


F 


M0003334-iA:B06 


CH09LNL 


1393 


14648 


RTAOO0O:S98F,j. 1 1 . 1 .P.Seq 


F 


M0000436:C:G11 


CHOICOH 


1394 


12585 


RTA0O0O:S97F.i.2O. l.P.Seq 


F 


M0000426^A:F11 


CHOICOH 


1395 


15825 


RT A000029 1 6F.d. 12.1 .P.Seq 


F 


MO0032553A:AO7 


CH08LXH 


1396 


7043 


RTA00O0Z900F.h.07. l.P.Seq 


F 


M000050l-3:F02 


CH02COH 


1397 


29354 


RT A000029O5F.C. 13.1 .P.Sea 


F 


M000079S:C:F07 


CH03M.AH 


1398 


29703 


RTA00002907F.d.24. l.P.Seq 


F 


M0002214-C:E12 


CH03M.-VH 


1399 


63 11 


RTA0000Z9 1 3F.b.0~. l.P.Seq 


F 


M0002772^D:D04 


CH04M.\L 


1400 


12657 


RTA00OOZ906F.b.2O. l.P.Seq 


F 


N 10002 186cC:H0S 


CH03M.\H 



12^ 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


LIBRARY 


1401 


2033 


RTA00002922F.e.08.2.P.Scq 


F 


M00039024D:E12 


CH09LNL 


1402 


24229 


RTA00002920F.b.04. LP.Seq 


F 


MO0033329C:CO2 


CH08LNH 


1403 


20664 


RTA00002886Fa.07. LP.Seq 


F. 


M000O133SC:FO5 


CH01COH 


1404 


3656 


RT A00002 902 F.f. 20. LP.Seq 


F 


M00006641B:F05 


CH02COH 


1405 


10998 


RTA0000293 lF.c.07. LP.Seq 


F 


M0004287SD:G06 


CH16COP 


1406 


1150 


RTA00002922F,j. 14. 1 .P.Seq 


F 


M0OO3908lB:GO7 


CH09LNL 


1407 


45221 


RTA00002900F.h.06. i .P.Seq 


F 


M0OOO5O13D:HO5 


CH02COH 


1408 


34505 


RTA0000290 1 Fa. i 6 . 1 .P.Seq 


F 


M00005423OA10 


CH02COH 


1409 


8175 


RTA00002924F.L0L LP.Seq 


F 


M00039472B:E05 


CH09LNL 


1410 


8175 


RTA00002924F.e.24. 1 .P.Seq 


F 


M00039472B:E05 


CH09LNL 


1411 


19375 


RTA00002903F.n.02. 1 .P.Seq 


F 


M0000708IB:C08 


CH02COH 


1412 


10866 


RTA00002929F.C. 15. 1 .P.Seq 


F 


M00040219B:B07 


CH14EDT 


1413 


24166 


RTA0000289 lF.k.07. LP.Seq 


F 


M0OO03763A:BO2 


CH01COH 


1414 


15333 


RTA00002888F.C. 12. 1 .P.Seq 


F 


M00001442C:G12 


CHOICOH 


1415 


44436 


RTA00002907F.b. 17. LP.Seq 


F 


M00022117C:A02 


CH03MAH 


1416 


9247 


RTA00002930F.a. 16. LP.Seq 


F 


M00042560C:G06 


CH15C0N 


1417 


12317 


RTA00002908F.2.13. LP.Seq 


F 


M00022430CC06 


CH03MAH 


1418 


11968 


RTA00002890F.L24. 1 .P.Seq 


F 


M00001625D:B04 


CHOICOH 


1419 


14181 


RTA00002908F.n.09.2.P.Seq 


F 


M00022499D:D08 


CH03MAH 


1420 


15359 


RTA00002909F.L02. LP.Seq 


F 


M00022677C:COl 


CH03MAH 


1421 


46675 


RTA000029 16F.h.03. LP.Seq 


F 


M000325S-iA:D06 


CH08LNH 


1422 


24898 


RTA00002903F.k. 17. LP.Seq 


F 


M00007019B:E01 


CH02COH 


1423 


156424 


RTA00002905F.m.22. LP.Seq 


F 


M00021653A:B02 


CH03MAH 


1424 


U996 


RTA0000290 IF. b. 24. LP.Seq 


F 


M00005445A:E07 


CH02COH 


1425 


11996 


RT A0000290 1 F.c.OL 1 .P.Seq 


F 


M00005445A:E07 


CH02COH 


1426 


4784 


RTA00002894F.e.20. 1 .P.Seq 


F 


M0000398SD:BOl 


CHOICOH 


1427 


9120 


RTA000029 14F.h. 10. 1 .P.Seq 


F 


M00028210B:H03 


CH08LNH 


1428 


11295 


RTA00002890F.J. 15. 1 .P.Seq 


F 


M0000163:C:A10 


CHOICOH 


1429 


3991 


RTA0O0O2S96F.h.05. LP.Seq 


F 


M00004162D:F02 


CHOICOH 


1430 


20358 


RTA0000290SF.b.06. LP.Seq 


F 


M0002236~D:Gll 


CH03MAH 


1431 


12823 


RT A0000292 iF.h.OLLP.Seq 


F 


M00033434D:F05 


CH09LNL 


1432 


147419 


RTA00OO2906F.S.O5. t .P.Seq 


F 


M00021952B:G06 


CH03MAH 


1433 


12174 


RTA000029 19F.f. 13.1 .P.Seq 


F 


M00033071D:£08 


CH08LNH 


1434 


35608 


RTA00002897F.O.24. LP.Seq 


F 


M00004296B:D03 


CHOICOH 


1435 


2325 


RTA0O0O2894F.g.O7. LP.Seq 


F 


M00003994A:B10 


CHOICOH 


1436 


166261 


RTA000O2908F.I.O5. LP.Seq 


F 


M00022475D:C07 


CH03MAH 


1437 


5713 


RTA0O00292ORa.O9. LP.Seq 


F 


M00033324B:F04 


CHO8L1NH 


1438 


3624 


RTA0O0029 1 OF. g.06. LP.Seq 


F 


M00022901A;C05 


CH03MAH 


1439 


10305 


RTA0OOO29O9F.a.O7. LP.Seq 


F 


M00022530B:C04 


CH03MAH 


1440 


7768 


RTA0OOO29 10F.k.22. LP.Seq 


F 


M00022992B:G12 


CH03MAH 


1441 


9847 


RTA000O2908F.p.O7. 1 .P.Seq 


F 


M00022516B:C05 


CH03MAH 


1442 


8583 


RTA00002887F.O.06. 1 .P.Seq 


F 


M00001426OF06 


CHOICOH 


1443 


24376 


RTA0O0O2900F.b.O7. LP.Seq 


F 


M00004S3cB:C02 


CH02COH 


1444 


8743 


RTA00002907F.n. 19. LP.Seq 


F 


M0002226ZA:F06 


CH03M.AH 


1445 


22251 


RT A00002926F.C. 10.2.P.Seq 


F 


M00040079B:F06 


CH09LNL 


1446 


12337 


RTA0000292SF.d.07. LP.Seq 


F 


M00040173D:A04 


CH13EDT 


1447 


13623 


RTA0000291 tFd.08.2.P.Seq 


F 


M00026S42B:AOl 


CH04MAL 


1448 


5521 


RTA00002SS7FJ.06. 1 .P.Seq 


F 


M00001406B:H09 


CHOICOH 


1449 


2193 


RTA00002933F.a. 1 3. 1 .P.Seq 


F 


M0004307"B:Fll 


CH19C0P 


1450 


773 


RT A00002 SS9F. j.02. 1 . P.Seq 


F 


M00001551D:H09 


CHOICOH 
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SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ID 


L1BR.ARY 


1451 


142367 


RTA00002927F.h. 11. l.P.Seq 


F 


M00O3963OD:BO7 


CHPEDT 


1452 


19284 


RTA000028S9F.e. 10. l.P.Scq 


F 


M00001537B:H10 


CHOICOH 

* A, XV 1 Xy XkV 7 A A 


1453 


24011 


RTA00002924F.C. 17. l.P.Seq 


F 


M00039440C:G06 


CH09LNL 


1454 


5930 


RTA0000291 IF. t.08. l.P.Seq 


F 


M00026910B:G06 


CH04MAL 


1455 


21581 


RTA00002902F.C.05. l.P.Seq 


F 


M00005S22CA04 


CH02COH 


1456 


3662 


RTA00002925F.C.07. l.P.Seq 


F 


M00039826D:EO4 


CH09LML 


1457 


4873 


RTA00002930F.b.05. l.P.Seq 


F 


M00042719A:G08 


CH15CON 


1458 


11214 


RTA00002896F.h.Ol. l.P.Seq 


F 


M00004161A:E08 


CHOICOH 


1459 


22888 


RTA00002892F.I.09. l.P.Seq 


F 


M0OOO3837C.-D1O 


CHOICOH 


1460 


15490 


RTAOO0O2925F.k.O8. 1 .P.Seq 


F 


M00039932B-.A07 


CH09L^fL 


1461 


112819 


RTA00002905F.0. 13. l.P.Seq 


F 


M00021676CG03 


CH03MAH 


1462 


19688 


RT A00002896F.I.02. 1 .P.Seq 


F 


M00004179D:A12 


CHOICOH 


1463 


15132 


RTA00002922F.n.20. l.P.Seq 


F 


MO0O3913SB:GO5 


CH09LNL 


1464 


25022 


RTA000029 14F.L2 Ll.P.Seq 


F 


M000282I9B:H05 


CH08LNH 


1465 


16303 


RTA00002888F.b. 12. l.P.Seq 


F 


M0000143SA:E01 


CHOICOH 


1466 


16828 


RTAO0002897F.b.O4. l.P.Seq 


F 


M00004214A:E05 


CHOICOH 


1467 


14295 


RTA0000292 lF.a. 18. 1 .P.Seq 


F 


M00033296C:C11 


CH09LNL 


1468 


1979 


RTA00002930F.f.06. 1 P.Seq 


F 


M00055725D:D09 


CH15CON 


1469 


36248 


RTA00002888F.2.05.1.PSeq 


F 


M00001460C:E10 


CHOICOH 


1470 


5676 


RT A00002926F. b. 22.2 .P.Seq 


F 


M000400753:A05 


CH09LNL 


1471 


P39 


RTA00002887F.O.2 1 . 1 .P.Seq 


F 


M0000142SBC10 

A w 4 XV XV XV X* » > X* ■ X^^ & XV 


CHOICOH 

& -m XV m _~ ^V & A 


1472 


7937 


RTA00002917F.g.22. l.P.Seq 


F 


M0003272SD:F01 


CH08LNH 


1473 


4483 


RTA0000291 lF.d.22.2.P.Seq 


F 


M000268563:G03 


CH04M.AL 


1474 


7796 


RTA00002925F.C.05. l.P.Seq 


F 


M00039826B:F09 


CH09LNL 


1475 


17330 


RTA000029 15F.a.03. l.P.Seq 


F 


M00028616C:D09 


CH08LNH 


1476 


25620 


RTA00002902F.f.09. l.P.Seq 


F 


M00006631C:A04 


CH02COH 


1477 


20601 


RTA00002923F.1.20. l.P.Seq 


F 


M00039326A:G07 


CH09LNL 


1478 


6205 


RTA00002923F.g.2 1 . 1 .P.Seq 


F 


M00039255C:C01 


CH09LNL 


1479 


726 


RTA00002913F b. 16.1 P Seq 

A X A * % X^ X^ X^ ^ X- V A * XV • • X^ 4 m * * ♦ X^ X4 


F 


MOOO°773-D-C03 


CH04MAL 


1480 


104999 


RTA00002908F.S. 17. l.P.Seq 


F 


M000224353:G12 


CH03MAH 


1481 


303^1 

» XV »V m K 


RTA000029 1 9F.o. 1 7 . 1 . P.Seq 


F 


M0003 i26-^B:E06 


CH08LNH 


1482 


5878 


RTA000029 13F.a. 16. l.P.Seq 


F 


M000276SSC:C01 


CH04MAL 


1483 


5944 


RTA00002905F.rn.07. 1 P.Seq 


F 


M0002I649B:A02 


CH03MAH 


1484 


5796 


RTA00002908F.i.2 I. l.P.Seq 


F 


M0002245"A:G05 


CH03M.AH 


1485 


3804 


RTA00002935F.rn.24. l.P.Seq 


F 


M0005525-iA:H03 


CH17COHLV 


i486 


2728 


RTA000029 1 SF.a.22. 1 P.Seq 


F 


M00032S2SA.A06 


CH08LNH 


1487 


3804 


RTA00002935Rn.Ol. l.P.Seq 


F 


M000552^ AH03 


CH17COHLV 


1488 


3932 


RTA000029 15F.0. 1 9. 2. P.Seq 


F 


M000325 1"C:E10 


CH08LNH 


1489 


16691 


RTA00002S91F o.03 1 P Seq 


F 


MOOOOvSC A GOl 

A w * x^ XV x^ XV » " X* * » » ^ ■ ■ ^ * 


CHOICOH 


1490 


15430 


RTA0O0O°9OOF o 10 1 PSeq 

\ X A * » *J XV Xy V X^ XV » * ■ * X^ - A. * 4 ' ^V X* XJ 


F 


MOOOO'iO0 :: D'CO2 

It 1 XV W W W XV XV ^ ■ \<r XV 


CH02COH 


1491 


5637 

•mf V * 


RTA0O0O2925F b IS I P Seq 


F 


MO0O39S20B-F06 


CH09LNL 


1492 


16633 


RTA00002S97F.2. 15. l.P.Seq 


F 


M0000424c3:H07 


CH01COH 


1493 


21826 


RTA00002S98F.S.06. l.P.Seq 


F 


M0000434-A:Gll 


CHOICOH 


1494 


22193 


RTA000029 19F.L09. 1 .P.Seq 


F 


M00033UcD:A03 


CH08LNH 


1495 


10720 


RTA00002S9SF.C. 14. l.P.Seq 


F 


M000043:CC:E07 


CHOICOH 


1496 


22491 


RTA00002925F.m.06. 1 P.Seq 


F 


M000a0003A:G10 


CH09LN1- 


1497 


10423 


RTA000029l5F.n.l3.2. P.Seq 


F 


M0003:50"D:GOS 


CH08LNH 


1498 


4953 


RTA000029l6F.h. 11. l.P.Seq 


F 


MO0O325$eC:BO4 


CHOSLNH 


1499 


185567 


RTA000029 1 lF.p.OS. l.P.Seq 


F 


M0002717S3-.AI I 


CH04M.AL 


1500 


25605 


RTA00002924F.m. 22. l .P.Seq 


F 


M000397lC3:A0l 


CH09LNL 



1^ 
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sEQ 












CD 


CLUSTER 


SEQ NAME 


ORIENTATION 


CLONE ED 


LIBRARY 


1501 


29446 


RTA00002906Rm.24. 1 .P.Seq 


F 


M00022070B:B04 


CH03MAH 


1502 


9668 


RTAOOOG2908F.g.02.i.RSe<| 


F 


M00022421A:F12 


CH03MAH 


1503 


29446 


RTA00002906F.n.OL 1 .P.Seq 


F 


M00022070B:B04 


CH03MAH 


1504 


7171 


RTA00002887F.rn.22. 1 .P.Seq 


F 


M00001421B:E07 


CH01COH 
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Table 3 





Nearest Neighbor (BlastN vs. CenbanJc) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins j 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P V A1_LTF 

A ▼ *^4^w i i ^ - 


I 






<NONF> 














<NOMK> 


WIN CJ> 






•> 












<INUiNc> 


4 






<MOMF> 

V± » » I S 




<*N vJlN O 




5 








vm m> 






6 










Vi ' \Jl^* 




7 










^.•.^ \«/t^ c*^ 




8 


V 4.^ Vjf 1^1 












9 


<NONTFv 




*-NONF> 

VA ^1 \J 1 




Nil Vw^l ~ c^* 


vi ^ KJ I ^* X_-> 


10 


vl> WIN E_»> 




V^ 1 Vll C-»* 








H 






V»^f V_/i ' Cm«' 








12 


<N0NE> 


<N0NE> 


<NONE> 


<N0NH> 


<NONE> 


<NONB> 


13 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


<N0NE> 


14 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<NONE> 


15 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<N0N& 


<N0NE> 


16 


<NONE> 


<N0NE> 


<NONE> 


<N0NE> 


<N0NE> 


<N0NE> 


17 


<1NUINC> 


<JNVJINc> 




<INLJlNn> 






IS 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


19 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


<N0NE> 


20 


<NONE> 


<NONE> 


<NONE> 


<N0NE> 


<N0NE> " 


<N0NE> 


21 


<NONE> 


<NONE> 


<NONE> 


<N0NE> 


<N0NE> 


<NONE> 


22 


<{NUtNh> 


<INUlNfc> 


<*NiJlNE> 


<rNuiN t> 






ij 


<NONE> 


<N0NE> 


<N0NE> 


54S562 


GENOME POLYPROTELN 

[CONTAINS - RN'A 
REPLICASE ; HELICASE: 
COAT PROTEIN] 17.7.48) - 
uppic sicm aroo^ing y u u> 
(strain P-209) 


9 : 


24 


<N0NE> 


<NONE> 


<N0NE> 


416959 


EXCISION REPAIR PROTELN 
ERCC-6 DNA repair helicase 
ERCC6- human >gi|1821Sl 
(L04791) excision repair protein 
[Homo sapiens] 


8.9 


25 


<iNUfNh> 


<INUfN L> 




J J - / L/VD 


(ABO 1454 1) KIAA064I protein 
[Homo sapiens] 


o - 

o . . 


26 


<N0NE> 


<N0NE> 


<N0NE> 


861293 


Kj-oI'+l) rJJU-.l scnc 

product [Caenorhabditis 
eleaans] 


7.9 


27 


<N0NE> 


<NONE> 


<NONE> 


3297821 


protein 


5.5 


23 


<NONE> 


<NONE> 


<NONE> 


21 19692 


transforming growth factor- beta 
type HI receptor - chicken 
>»i|51 1843 (L01 121) 
transforming growth factor-beta 
type III receptor ( Galium ^allusj 


5.1 


29 


<N0NE> 


<NONE> . 


<N0NE> 


21 3602 S 


protein kinase PRK1 - human 


5.0 
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■■- 


Nearest Neighbor (BlastN vs. GenbanJO 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















30 


<NONE> 


<NONE> 


<NONE> 


2746912 


(AF040659) No definition line 
found [Caenorhabditis elegans] 


4.6 


31 


<NONE> 


<NONE> 


<NONE> 


2358287 


(AF0 10404) ALR [Homo 
sapiens] 


4.5 


32 


<NONE> 


<NONE> 


<NONE> 


3877816 


(Z96048) predicted using 
Genefinder; cDNA EST 
EMBL:D655 16 comes from this 
gene;cDNA EST ykl91a5.5 
comes from this gene 
[Caenorhabditis clegans] 


4.4 


33 


<NONE> 


<NONE> 


<NONE> 


4140268 


(Y14953) SRCR domain, 
membrane form 2 


4.1 


34 


<NONE> 


<NONE> 


<NONE> 


1708663 


(U51 183) transposase [Hydra 
vulsarisl 


4.0 


35 


<NONE> 


<NONE> 


<NONE> 


1184100 


(U45958) pistil extensin-Iike 
protein [Nicotiana alata] 


3.9 


36 


<NONE> 


<NONE> 


<NONE> 


121073 


GLUCOCORTICOID 
RECEPTOR CGR) 


3.9 


37 


<NONE> 


<NONE> 


<NONE> 


1718298 


(U75698) ORF 45; contains an 
extended acidic domain; EBV 
BKRF4 homolog [Kaposi's 
sarcoma-associated herpesvirus] 
homolog, conserved in other 
gamma- herpesviruses 


2.6 


38 


<NONE> 


<NONE> 


<NONE> 


2352538 


(AF006564) alcohol 
dehydrogenase [Drosophila 
persimilis] persimilis] 


14 


39 


<NONE> 


<NONE> 


<NONE> 


3L92897 


(AF066071)SP85;PsB 
[Dicryostelium discoideum] 


1.4 


40 


<NONE> 


<NONE> 


<NONE> 


< 

561645 


(L33421) This CDS feature is 
included to show the translation 
of the corresponding V_region. 
Presently translation qualifiers 
on V_region features are illegal 


1.0 


41 


<NONE> 


<NONE> 


<NONE> 


3878S57 1 


{jC6jiid) predicted using 
Genefinder; cDNA EST 
EMBL:D35016 comes from this 
gene; cDNA EST 
EMBL.D32583 comes from this 
gene; cDNA EST 
EMBL;D35258 comes from this 
gene; cDNA EST 
EMBL:C 1 147 1 comes from this 
gene; cDNA EST EMBL:C... 


1.0 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlasiX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(U75903) UGT1A7 [Rattus 




42 


<NONE> 


<NONE> 


<NONE> 


1658571 


norvegicusl 


1.0 


43 


<NONE> 


<NONE> 


<NONE> 


2338034 


(AF005370) putative immediate 
early protein [Alcelaphine 
herpesvirus 1] 


0.86 


A A 


<NONE> 


<NONE> 


<NONE> 


3043714 


(AJJOI 1 lo/) tsJ aaujVj protein 
[Homo sapiens] 


0.42 


45 


<NONE> 


<NONE> 


<NONE> 


* 

17237 10 


H YPOTKbT ILAL yz / KD 
PROTEIN EN ASN2-PHB I 
INTERGENIC REGION 
>gipl31678|pir||S64439 
hypothetical protein YGR130c - 
yeast (Saccharomyces 
cerevisiae) 

>gi| 1 3232 1 5 |gnl |PID|e243523 
(Z72915) ORF YGR130c 
[Saccharomvces cerevisiae] 


0.40 


46 


<NONE> 


<NONE> 


<NONE> 


1723710 


HYPO 1 Hh'l IL'AL y '? 1 KJJ 
PROTEIN IN ASN2-PHB1 

>gi|213167S|pir||S64439 
hypothetical protein YGR130c - 
yeast (Saccharomyces 
cerevisiae) 

>gi|l3232 15|gnljr LU|ei4jjZJ 
(Z72915)ORFYGRl30c 
[Saccharomvces cerevisiae 1 


0.38 


47 


<NONE> 


<NONE> 


<NONE> 


2996117 


(AF046125) immediate early 2 
[Rat cytomegalovirus] 


0.26 


A O 

48 


<NONE> 


<NONE> 


<NONE> 


4151809 


(AF102853) synaptic bAFAr- 
interacting protein Synamon 


0.024 


AO 


<NONE> 


<NONE> 


<NONE> 


2773341 


(AF040954) putative protein 
pnospnatase l nuciear largeiing 
subunit [Rattus norvesicus] 


0.017 


50 


<NONE> 


<NONE> 


<NONE> 


L653522 


(D90914) hypothetical protein 


3e-04 


51 


<NONE> 


<NONE> 


<NONE> 


3219965 


HYPOTHETICAL 100.6 KD 
TRP-ASP REPEATS 
CONTAINING PROTEIN 
C2C6.04C IN CHROMOSOME 
I 


3e-06 


52 


<NONE> 


<NONE> 


<NONE> 


4185567 


(AF1 15480) cAMP-dependent 
Rapl guanine-nucleotide 
exchange factor [Mus musculus] 


7e-07 
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Nearest Neighbor (BlastN v$. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


| DESCRIPTION 


P VALUE 












HYPOTHETICAL 45.5 KE> 




53 


<NONE> 


<NONE> 


<NONE> 


1 176527 


CHROMOSOME III 
>gi|500724 (U 10402) C34E10.1 
gene product [Caenorhabditis 
elegans) 


3e-20 


54 


X35444 


G.pallida repetitive 
DNA element 


50 


2118936 


beta-globin - chimpanzee 
(fragment) 


8.6 


55 


X72961 


Synechococcus sp. 
cpeB, cpeA genes and 
ORF3 


5.0 


462569 


MICROTUBULE- 
AoMJCIAI fcJJ rKUI.fc.iIN 1A 
microtubule-associated protein 
MAPlA-rat >gi (205538 
norveeicus] 


2.2 


56 


U94747 


Human WD repeat 
protein HANI 1 
mRNA. complete cds 


5.0 


3875538 


(267990) similar to cuticle 
collagen 


1.3 


57 


AF032 108 


Homo sapiens 
integrin alpha- 7 
mRNA. complete cds 


5.0 


2147194 


collaeen - Paralvinella erasslei 


0.002 


58 


Z50798 


G.gallus mRNA tor 
p52 


5.0 


3122885 


ASPARTYL-TRNA 
SYNTHETASE synthetase 
[Bacillus subtilis] 


3e-ll 


59 


AB002384 


Human mRNA tor 
KIAA0386gene, 
complete cds 


5.0 


2632098 


(Y 155 13) Prodos protein 
[Drosophila melanoeaster] 


9e-12 


60 


XI4835 


Thermofilum pendens 
DNA for 16S and 
23S ribosomal RNA t 
tRNA-Met, and tRNA 
Gly 


4.9 


<NONE> 


<NONE> 


<NONE> 


61 


U87149 


Hordeum vulgare 
nucellin gene, 
complete cds 


4.9 


128578 


NONSTRUCTURAL 
rKCJlfcllN No-i> spotted wilt 
virus (strain CPNH1) non- 
structural protein [Tomato 
spotted wilt virus] 


2.8 


62 


D87541 


Mus musculus gene 
for integrin alpha v 
subunit, promoter 
region 


4.9 


J 36956 


HYPOTHETICAL PROTEIN 
UL6 1 cytomegalovirus (strain 
AD 1 69) cytomegalovirus] 


0.038 


63 


U72520 


Mus musculus mena 
protein (Mena) 
mRNA. complete cds 


4.9 


3413892 


(AB007934) KIAA0465 protein 
Homo sapiens] 


6e-07 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















64 


S79797 


enzymatic 
glycosylation- 
regulating gene [rats. 
Sprague-Dawley, 
streptozotocin 
diabetic, heart, 
mRNA, 5010 m) 


4,8 


<NONE> 


<NONE> 


<NONE> 


65 


AB011102 


Homo sapiens mRNA 
forKlAA0530 
protein, partial cds 


4.8 


138022 


RECEPTOR RECOGNIZING 
PROTEIN gp38 - phage Ox2 
>gi|15126 (X05675) gene 38 
(AA 1-266); pid:gl5126 
(Bacteriophage 6x2] 


3.6 


66 


AF 100985 


Penaeus monodon 
phosphopymvate 
hydratase mRNA, 
complete cds 


4.8 


500615 


(DI6221) endochitinase [Oryza 
satival 


2.8 


67 


U31756 


Bacillus subtilis 
gamma- 
ami nobuty rate 
permease cds 


4.8 


3880699 


(AL021471) similar to 
Eukaryotic aspartyl proteases 
[Caenorhabditis elegans] 
Eukaryotic aspartyl proteases 
[Caenorhabditis elegans] 


2.8 


68 


U25II1 


Pi sum sativum 
chloroplast 
processing enzyme 
mRNA, nuclear gene 
encoding chloroplast 
protein, complete cds. 


4.8 


1800145 


(U83658) FH1/FH2 protein 
homolog fEmericella nidulans] 


1.6 


69 


U00454 


Mus musculus Cdx-2 
homeobox protein 
gene, complete cds. 


4.7 


<NONE> 


<NONE> 


<NONE> 


70 


MS4166 


Hamster c-Ha-ras 
protein gene, 
complete cds. 


4.7 


1710606 


REN IN-BINDING PROTEIN 
(RNBP) protein [Rattus 
norveeicus] 


0.88 


71 


AF087516 


Mus musculus major 
sperm fibrous sheath 
protein Pro- 
mAKAP82 gene, 
alternative splice 
exons 1* and 1" 


4.6 


<NONE> 


<NONE> 


<NONE> 


72 


X74160 


M.esculenta mRNA 
for granule-bound 
starch synthase 


4.6 


<NONE> 


<NONE> 


<NONE> 


73 


M97487 


Haloferax volcanii 
superoxide dismutase 
(sod2) gene, complete 
cds. 


4.6 


2623307 


(AC002409) putative ubiquitin 
protease (Arabidopsis thaliana) 


3.4 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















74 




melanogaster 
suppressor of sable 
gene, complete cos. 




viN VJfNll> 




— — — 


75 


D49708 


Rattus norvegicus 
mRNA for RN A 
binding protein 


4.5 


<NONE> 


<NONE> 


<NONE> 


76 




glycin-threonin/serine 
repeat protein, 
complete cds 




■£447193 


(U42580) NKl'I'F (7x), DETTS 
(4x) [Paramecium burs aria 
Lnlorella virus 1 J 


3.3 


77 


Z47036 


Human partial cDNA 
sequence, clone 
bs6I3; 


2.9 


<NONE> 


<NONE> 


<NONE> 


/ a 


LI 9660 


Rattus norvegicus 
gastric inhibitory 
peptide receptor 
mRNA, complete cds 


2.7 


2358279 


(AF007871) torsinA [Homo 
sapiens] 


2e-07 


79 


XS2841 


A.thahana Aco aene 


2.6 


48^212 


immediate-early protein EEI 10 • 
human herpesvirus 1 (strain 
HFEM) (fragment) 


O A 

8.4 


80 


X61931 


o .purpurascens iam/\ 
and famB genes for 
FAS domain and acyl- 
CoA-dehydrogenases, 
respectively 


2.6 


2290534 


(U95031) sublingual gland 
mucin [Homo sapiens] 


0.47 


81 


U13680 


Human lactate 
dehydrogenase-C 
(LDH-C) mRNA, 
complete cds, 


2.5 


• 2887449 


(AB007874) KIAA0414 [Homo 
sapiens] 


3.1 


82 


AB007869 


nomo sapiens 
KIAA0409 mRNA, 
partial cds 


2.4 


3130157 


(AB008859) pheromone 
receptor [Fugu rubripes) 


5.4 


83 


X97479 


H.sapiens mas proto- 
oncosene. 5' reaion 


2.1 


<NONE> 


<NONE> 


<NONE> 


84 


X98374 


R.norvegicus mRNA 
for KIS protein 


1.9 


<NONE> 


<NONE> 


<NONE> 


85 


AE0007I0 


Aquifex aeolicus 
section 42 of 109 of 
the complete genome 


1.9 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Homo sapiens mRNA 










86 


D306I2 


for repressor protein, 
partial cds 


1.9 


<NONE> 


<NONE> 


<NONE> 


87 


Y 1432 I 


Homo sapiens 
PMP69 gene, exons 
8,9,10 & 11 


i.9 


<NONE> 


<NONE> 


<NONE> 


88 


D90773 


E.coli genomic DNA, 
Kohara clone 
#262(30.3-30.5 min.) 


1.9 


1536816 


(D78305) DNA binding protein 
[Chlorella virus) 


7.9 


89 


AE000991 


Archaeoglobus 
fulgidus section 1 16 
of 172 of the 
complete genome 


1.9 


520645 


(X79095) 

pyruvate .orthophosphate 
dikinase [Flaveria trinervia] 


2.7 


90 


U39476 


Rattus norvegicus 
p95 Vav (Vav) proto- 
oncogene mRNA. 
complete cds. 


1.9 


4158178 


(AL023496) hypothetical 
protein 


1.6 


91 


U28S38 


Human transcription 
factor TFIIIB 90 kDa 
subunit 


1.9 


2495730 


HYPOTHETICAL PR0LJNE- 

RICH PROTEIN KIAA0269 
>gi 1 1 665 805|gnl|PID|d 1 0 1 4089 
(D87459) Similar to Volbox 
carten extensm (S2269/) 
[Homo sapiens] 


0.23 


92 


U20106 


Rattus norvegicus 
synaptolagmin VII 
mRNA. complete cds. 


1.9 


478380 


UL47h protein - Marek's disease 
virus 


0.23 


93 


AF071010 


Mouse mammary 
tumor virus putative 
integrase, env 
polyprotein, and 
superantigen mRNA, 
complete cds 


1.9 


2781386 


(AC004010) similar to Leucine- 
rich transmembrane proteins; 
44% similarity to U42767 
(PID'.g 17369 18) [Homo 
sapiens] 


4e-33 


94 


AF06I88I 


Mesocricetus auratus 
c-fos proto-oncogene 
protein (c-fos) gene, 
complete cds 


1.8 


<NONE> 


<NONE> 


<NONE> 


* 

95 


AE001397 


Plasmodium 
falciparum 
chromosome 2, 
section 34 of 73 of 
the complete 
sequence 


1.8 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Horseshoe crab 










96 


D14701 


mRNAfor 
coagulation factor B, 
complete cds 


i.8 


<NONE> 


<NONE> 


<NONE> 


97 


M29154 


P.falciparum 
multidrug resistance 
(MDR) gene, 
complete cds. 


1.8 


<NONE> 


<NONE> 


<NONE> 


98 


L16532 


Rattus norvegicus 
(clone pCNPII) 2\3 - 
cyclic nucleotide 3- 
phosphodiesterase 
(CNPII) mRNA, 
complete cds. 


1.8 


<NONE> 


<NONE> 


<NONE> 


99 


AE001434 


Plasmodium 
falciparum 
chromosome 2, 
section 71 of 73 of 
the complete 
sequence 


1.8 


<NONE> 


<NONE> 


<NONE> 


100 


Z467 85 


D.melanogaster gene 
for protamine 
(mst35Bb). 


1.8 


<NONE> 


<NONE> 


<NONE> 


101 


X69822 


P.sylvestris mRNA 
for glutamine 
synthetase 


1.8 


219896 


(D90452) l-caldesmon I [Homo 
sapiens] 


9.7 


102 


U49055 


Rattus norvegicus 
CTD-binding SR-like 
protein rA8 mRNA, 
complete cds 


1.8 


2497252 


INSULlN-LlKb GUI) W 1 H 
FACTOR BINDING PROTEIN 
4 (IGFBP-4) (IBP-4) (IGF- 
BINDING PROTEIN 4) factor- 
binding protein-4 - sheep 
(fragment) factor-binding 
protein-4, IGFBP-4 {sheep, 
liver, Peptide, 237 aa] [Ovis 
aries] 


2.5 


103 


L28101 


Homo sapiens 
kallistatin (PI4) gene, 
exons 1-4, complete 
cds 


1.8 


4204267 


(AC005223) 55585 
[Arabidopsis thaliana] 


2.4 


104 


U66987 


Pandorina morum 
internal transcribed 
spacer 1, 5.3S 
ribosomal RNA «cne. 
and internal 
transcribed spacer 2, 
complete sequence 


1.8 


2635909 


(Z9912I) permease [Bacillus 
subtilis) 


1.9 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor tBlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human polymorphic 










105 


AJOUjj 


Mspl site DNA 

\L/JJJ IOCU5/ 


1 8 


^136878 


keratin KAP5.5 - sheep 
(fragment} >«?ii3 13722 


0.65 


106 


TT 1 S7RH 


Human p82 (ST5) 
mRNA, alternatively 


1 8 


3638957 


(AC004877) sco-spondin-raucin- 
like; similar to P98167 uncertain 
fHomo saoiensl 


0.64 


107 




Homo sapiens 
synaptotagmin VII 

mCNlA nirtttl rHc 
ITLlVJ^t r\, llui tlij 


] 8 


457927 


(U00690) calcium channel alphai 
1 subunit [Drosophila 
melanocasterl 


051 


108 




Homo sapiens clone 
23585 mRNA 
sequence 






HOMEOBOX PROTEIN HOX- 
D 1 (HOX-4 9) 


0 28 


109 


X75208 


H.sapiens HEK2 
mRNA for protein 
tyrosine Kjnabc 
receptor. 


1.8 


1730198 
— — . — 


GROWTH- ARREST-SPECIFIC 
PROTEIN 1 <>ene oroduct 
[Homo sapiens] 


0.22 


110 


ABO 13896 


Xenopus laevis 
mRNA for SOX-D, 
complete cds 


1.8 


2494501 


TRANSCRIPTION FACTOR 
FKH-4 factor [Mus muscuius] 


0.17 


111 


D 16947 


Human HepG2 3' 
region cDNA, clone 
hmd6bl0 


1.8 


3413870 


(AB007923) KI A A0454 protein 
[Homo sapiensl 


0.002 


1 12 


D 1 3547 


Mouse L/iN/\, i eany 
alpha (TEA) region 


1.8 


3393018 


(A\ 0T1 174} hvnothetical 
protein 


5e-08 


113 


M35498 


Woodchuck c-myc 
protein gene, exon 1 . 


l.S 


3183405 


HYFU'lHfclRJAL 11.3KD 
PROTEIN C2C6.07 IN 
CHROMOSOME I 
>gi|23705O4|gnl|PID|e339 1 94 
pombe] 

>oii345 1 305l2nllPID!e 13 16730 
(AL031324) very hypothetical 
protein [Schizosaccharomyces 
pombe] 


Se-10 


114 


M84166 


t-fnmctpr f*-Wfi»rn^ 

protein gene, 
complete cds. 


1.8 


3386622 


(AC004665) unknown protein 
f Arabidopsis thaliana] 


2e-l0 


115 


U33135 


Mychodea carnosa 
18S ribosomal RNA 
gene, complete 
sequence 


1.8 


3334982 


(AC005306) R27216_l [Homo 
sapiens] 


3e-22 


116 


U 84003 


Homo sapiens 
putative tumor 
suppressor (BIN1 ) 
gene, exons 7-12 


1.7 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















117 


AE001121 


Borrelia burgdorferi 
{section / oi oi 
the complete genome 


1.7 


<NONE> 


<NONE> 


<NONE> 


118 


AE0011I4 


Archaeoglobus 
fulgidus section 165 
or i / — or tne 
complete genome 


1.7 


<NONE> 


<NONE> 


<NONE> 


119 


U82064 


Angiostrongylus 

^TnTAn^ncic nn 1 1 1 1- 

Cd.IlLUllCll3ia uUUll 

specific muscle 
protein- 1 gene, partial 
cds 


1.7 


<NONE> 


<NONE> 


<NONE> 


120 


AF041836 


Buchnera aphidicola 
piasmiu pi.eu-og, 
complete plasmid 
sequence 


1.7 


<NONE> 


<NONE> 


<NONE> 


121 


M87479 


Lvmnaea stasnans 

4 W 

FMRFamide sene, 
mature peptides. 


1.7 


<NONE> 


<NONE> 


<NONE> 


122 


M55163 


Aenopus laevis 
fibroblast growth 
factor receptor 
mRNA* complete cds. 


I, / 


<JNUlNt> 


<JS\JriE> 




123 


S57565 


histamine H2- 
receptor [rats. 
Genomic. 1928 nt] 


1.7 


<NONE> 


<NONE> 


<NONE> 




M27256 


Simian 

immunodeficiency 
virus (SIV) pol 
region. 


1.7 


<NONE> 


<NONE> 


<NONE> 


125 


U31516 


Human chromosome 
8 anonymous clone 
pBS8-165 


1.7 


<NONE> 


<NONE> 


<NONE> 


126 


X12671 


Human gene tor 
nuclear 

ribonucleoprotein 
(hnRNP) core protein 
Al 


1.7 


<NONE> 


<NONE> 


<NON'E> 


127 


AF009054 


Paeonia suttruticosa 
ssp. spontanea 
alcohol 

dehydrogenase IB 
( AdhlB) gene, partial 
cds 


1.7 


<NONE> 


<NONE> 


<NONE> 



\H1 
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Nearest Neiehbor (BlastN vs. GenbanJc) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















128 


AF046917 


Mus muse ul us 
transketolase gene, 
exon 6 and partial cds 


1.7 


<NONE> 


<NONE> 


<NONE> 


129 


D89053 


Homo sapiens mRNA 
for Acyl-CoA 
synthetase 3, 
complete cds 


1.7 


<NONE> 


<NONE> 


<NONE> 


130 


Uj /7D0 


Staphylothermus 
mar in us surface layer- 
associated STABLE 
protease gene, 


1 7 


1 1 

<NONE> 


<NONE> 


<NONE> 


131 


L39072 


Bovine herpesvirus I 
(clone p95) UL24 

complete cds. 


1.7 


<NONE> 


<NONE> 


<NONE> 


132 




Drosophila simulans 
retrotransposon 297 
5*-LTR and flanks 


1 7 


<NONE> 


<NONE> 


<NONE> 


133 


rVCUU 111** 


Archaeoglobus 
fukidus section 165 
of 172 of the 


1 7 


<NONE> 


<NONE> 


<NONE> 


134 


X04434 


Human mRNA for 
insulin-like growth 
factor I receptor 


1.7 


<NONE> 


<NONE> 


<NONE> 


135 


U07890 


Mus musculus 
C57BLV6J epidermal 
surface antigen 
(mesa) mRNA, 
comDlete cds. 


1.7 


<NONE> 


<NONE> 


<NONE> 


136 


D26163 


Human tyrosinase 
gene, 5-flanking 
region cell-specific 
transcription) 


1.7 


<NONE> 


<NONE> 


<NONE> 


137 


AF093818 


Panorpa nipponensis 
NADH 

dehydrogenase 
subunit 5 gene, 
mitochondrial gene 
encoding 
mitochondrial 
protein, partial cds 


1.7 


<NONE> 


<NONE> 


<NONE> 
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i Y. 


Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Xenopus laevis 










138 


D50560 


rnRNAfor 
cytochrome P-450, 
complete cds 


1.7 


<NONE> 


<NONE> 


<NONE> 


139 


AF08348S 


Mus musculus 
phospholipase Dl 
(PLD1) gene, exons 
1 8 and 19 t complete 
sequence 


1.7 


<NONE> 


<NONE> 


<NONE> 


140 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


1.7 


■ <NONE> 


<NONE> 


<NONE> 


141 


M73749 


Streptococcus 
saJivarius 

thermophilus beta-D- 
galactose (lacZ) gene, 
complete cds. > :: 
gb|M63636|STRJLAC 
ZZ Streptococcus 
thermophilus bcta-D- 
galactosidase (lacZ) 
gene, complete cds. 


1.7 


<NONE> 


<NONE> 


<NONE> 


142 


AE001 1 14 


Archaeoglobus 
fulgidus section 165 
of 172 of the 
complete genome 


1.7 


2183023 


(U84971) unknown [Homo 
sapiens] 


9.2 


143 


L01983 


Human type IV 
sodium channel alpha 
polypeptide 


1.7 


130504 


Uf-NOML POLYPRUIL1N 
[CONTAINS: N-TERMINAL 
PROTEIN (PI); HELPER 
COMPONENT PROTEINASE 
INCLUSION PROTEIN (CI); 6 
KD PROTEIN 2 (6K2); 
GENOME-LINKED PROTEIN 
(VPG); NUCLEAR ... virus 
(strain D) 


9.2 


144 


L19731 


Plecotus rafinesquii 
mitochondrial 
cytochrome b gene, 5* 
end. 


1.7 


3327096 


(ABO 14541) KIAA064! protein 
[Homo sapiens] 


9.1 


145 


AE001U4 


Archaeoglobus 
fuleidus section 165 
of 172 of the 
complete genome 


1.7 


2183023 


(U84971) unknown [Homo 
sapiens] 


8.8 



IS? 
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Nearest Neishbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















146 


L27218 


Bos tauras serum 
amine oxidase 
mRNA> complete cds. 
> oxidase=amiloride- 
binding protein 
homolog [cattle, liver, 
mRNA, 2664 nt] 


1.7 


1174459 


SIGNAL TRANSDUCER AND 
ACTIVATOR OF 
TRANSCRIPTION 6 (IL-4 
STAT) >gi|559855 (U16031) IL- 
4 Stat [Homo sapiens] 


7.1 


147 


Z49868 


Caenorhabditis 
elegans cosmid 
W07E1 Incomplete 
sequence 
[Caenorhabditis 
elegansj 


1.7 


4204263 


(AC005223) 40409 
[Arabidopsis thaliana] 


6.7 


148 


AL022271 


Caenorhabditis 
elegans cosmid 
F32F2. complete 
Qeauence 

[Caenorhabditis 
eleaans] 


1.7 


2497969 


PERIPLASMIC NITRATE 
REDUCTASE PRECURSOR 
>gi|l086107|pir||S50163 nitrate 
reductase large chain precursor, 
periplasmic - Thiosphaera 
pantotropha >gi|600093 
(Z36773) periplasmic nitrate 
reductase large subunit 
[Paracoccus denitrificans] 


6.7 


149 


U43844 


Mus musculus cvclin 
D3 gene, complete 
cds 


1.7 


3861490 


(AF062037) capsid protein 
precursor [Thosea asigna virus] 


5.1 


150 


Z25464 


S.cerevisiae UNF1, 
LTV1,MRP8,CYB3 
and TGL 1 genes, 
complete CDS's 


1.7 


1255404 


(U53151) weak similarity to 
cytochrome b [Caenorhabditis 
elesans] 


4.1 


151 


U77846 


Human elastin gene, 
partial cds and partial 
3'UTR 


1.7 


3355682 


(AL031124) putative secreted 
ivase 


4.0 


152 


X62880 


S.scrofa mRNA for 
calcium release 
channel (CRC) 


1.7 


3327080 


(AB014533) K1AA0633 protein 
[Homo sapiens] 


4.0 


153 


Y00067 


Human gene for 
neurofilament subunit 
M CNF-M) 


1.7 


479829 


heterogeneous ribonuclear 
panicel protein homolog - 
Caenorhabditis elegans 
similarity to RNA recognition 
motifs [Caenorhabditis elegans] 


3.9 
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Nearest Neighbor (BlaslN vs. Genbank) 


Nearest Neighb 


or CBIastX vs. Non-Redundant Proteins) 


SEQ 

JUL/ 


ACCESSION 


DESCRIPTION 


P VALUE 




DESCRIPTION 


P VALUE 
















154 


X68393 


D.melanogaster gene 
for Beta- tubulin, 
exons 1 and 2 


1.7 


2342682 


(ACOOQ106) Contains similarity 
to Rattus AMP-activated protein 
kinase (gb|X95577). 
( Arabidopsis thaliana) 


3.8 


155 


AB012284 


Shuttle vector 
pAUR123 gene for 
Aur.l -C, complete cds 


1.7 


417704 


POL POLYPROTEIN 
(ORF1A/1B) (CONTAINS: 
RNA-DIRECTED RNA 
POLYMERASE ; HELICASE; 
PROTEASE ] 


3.8 


156 


M96633 


Rattus norvegicus 
mitochondrial 
intermediate 
peptidase (MTP) 
mRNA. complete cds. 


1.7 


2314209 


(AE000613) H. pylori predicted 
codine region HP 1054 


3.1 


157 


U49055 


Rattus norvegicus 
CTD-binding SR-Iike 
protein rA8 mRNA, 
complete cds 


1.7 


2497252 


INSULitf-LlKJb (JK0W1H 
FACTOR BINDING PROTEIN 
4 (1GFBP-4) (BP-4) (IGF- 
BINDING PROTEIN 4) factor- 
binding protein-4 - sheep 
(fragment) factor-binding 
protein-4, IGFBP-4 [sheep, 
liver, Peptide, 237 aa] [Ovis 
aries] 


3.0 


158 


Y I 5907 


Mus musculus mRNA 
for myc-imron- 
binding protein- 1 


1.7 


912776 


iduronate-2-sulfatase, IDS {EC 
3.1.6.13) Peptide Mutant, 550 
aa] 


3.0 


159 


U67600 


Methanococcus 
jannaschii section 142 
of 150 of the 
complete genome 


1.7 


2982355 


(AF052252) fork head domain 
protein FKD9 [Danio rerio] 


3.0 


160 


AF0J3759 


Homo sapiens 
calumein (Caiu) 
mRNA. complete cds 


1.7 


2982355 


(AF052252) fork head domain 
protein FKD9 [Danio rerio} 


2.9 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor ( BlastX vs. Non- Redundant Pre 


)teins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












V*u.^wuj jinmarity lu - ■ 




161 


AF062915 


Arabidopsts thaliana 
putative transcription 
factor (MYB90) 
mRNA. complete cds 


17 


■ 

3878065 


Human uiRNA piuiluU 
KIAA0077 (TR:Q 14997); 
cDNA EST yk243h8.5 comes 
from this gene; cDNA EST 
yk243h8.3 comes from this 
gene; cdin a no i yKjjyiw.j 
comes from this gene 
[Caenorhabditis elegans] 
>gi|38803 1 8|gnl(PID|e 1 349839 
(Z81133) Similarity to Human 
mRNA product KIAA0077 
(TR:Q 14997); cDNA EST 
yk243h8.5 comes from this 
gene; cDNA EST yk243h8.3 
comes from this gene; cDNA 
EST yk359h4.5 comes from this 
gene 


23 


162 


X87526 


H.sapiens genomic 
DNA (chromosome 
3; clone NL3003R) 


1.7 


3638957 


(AC004877) sco-spondin-mucin- 
like; similar to P98167 uncertain 
[Homo sapiensl 


2.3 


163 


AC005573 


Homo sapiens 
chromosome 5, PAC 
clone 202el3 


1.7 


2465540 


(AF005632) phosphodiesterase 
l/nucleotide pyrophosphatase 
beta [Homo sapiens] 


1.8 


164 


D83402 


Homo sapiens gene 
for prostacyclin 
synthase, exon 10 and 
complete cds 


1.7 


627608 


steroid hormone receptor TR3 - 
human sapiens) 


1.7 


165 


AFO53700 


Homo sapiens deltex 
(Dx) mRNA, 
complete cds 


1.7 


2662089 


(AB007864) KIAA0404 [Homo 
sapiens] 


1.7 


166 


AF043225 


Mus musculus 6- 
pyruvoyl- 
tetrahydropterin 
synthase (Pts) 
mRNA. complete cds 


1.7 


2352538 


(AF006564) alcohol 
dehydrogenase [Drosophila 
persimilis] persimilis] 


1.4 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiahbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















167 


U52917 


l nermus aquaucus 
thennophilus NADH 
dehydrogenase I 
subunits NQ07 
NQ06, NQOS, 
NQ04, NQ02, 
NQOLNQ03. 
NQ08, NQ09, 
NQOlO, NQOIL 
NQOI2, NQ013,and 
NQ014, complete 
cds. 


1.7 


2564334 


(AB006631)The human 
homolog of mouse Cux-2 
[Homo sapiens] 


1.0 


168 


X72222 


Mmusculus gene for 
serotonin 2 receptor 


1.7 


3875796 


similarity to itzst 
hypothetical YIK9 protein 
(SW:YIK9__YEAST); cDNA 
EST EMBL:T01252 comes 
from this gene; cDNA EST 
EMBL:D33205 comes from this 
gene; cDNA EST 
EMBL:D33955 comes from this 
gene; cDNA EST 
EMBL:D35484co... 


1.0 


169 


U23186 


Crotalus scutulatus 

PLA2-like 

pseudogene 


1.7 


853971 


(XS3413) DR5 [Human 
herpesvirus 6] >gi|853972 
(X83413) DR5 [Human 
herpesvirus 61 


0.99 


170 


M83118 


Mus musculus factor 
Vlll-associated 
protein (f8a) mRNA, 
complete cds. 


1.7 


3201617 


(AC004669) hypothetical 
protein [Arabidopsis thaliana] 


0.80 


171 


M38347 


Exoli ATP- 
dependent proteinase 
(Ion) gene, complete 
cds. 


1.7 


4140322 


rAL0312m>dJ2B3E3J.2(Cell 

Division Cycle 2-Like 2 
(PITSLRE. p58/GTA, 
Galactosyltransferase 
Associated Protein Kinase)) 
(isoform beta 2-2) [Homo 
sapiensl 


0.78 


172 


U28838 


Human transcription 
factor TFIIIB 90 kDa 
subun it 


1.7 


2495730 


HYPOTHETICAL PkOLINK- 

RICH PROTEIN KIAA0269 
>g i 1 1 665 805 jgn I|PID|d 1 0 14089 
(D87459) Similar to Volbox 
carteri extensin (S22697) 
[Homo sapiensl 


0.62 



!55 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















173 


t no A 01 

U72487 


Rattus norvegicus 
calcium-independent 
alpha- tatrotoxin 
receptor mRNA, 
complete cds 


1.7 


C A A A 1 1 

54441 1 


GLYCOPROTEIN GP100 
PRECURSOR (P29F8) 
aiscoideum] 


0.35 


174 


AE00071S 


Aquifex aeolicus 
section 50 of 1 09 of 
the complete genome 


1.7 


2497569 


FIBROBLAST GROWTH 
FACTOR RECEPTOR 3 
PRECURSOR (FGFR-3) 
(HEPARIN-BINDING 
GROWTH FACTOR 
RECEPTOR) 
>gl|2l 1 /oj l|pir|jI!5D ioj 
fibroblast growth factor receptor 
3 - mouse >gi|199I45 (M8I342) 
fibroblast growth factor receptor 
3 (Mus musculus] 


0.34 


175 


AF016897 


Oryza sativa GDP 
dissociation inhibitor 
protein OsGDI2 
(OsGDI2) mRNA. 
complete cds 


1.7 


125362 


MAUKUrriAU-h LULUiN T 
STIMULATING FACTOR I 
RECEPTOR PRECURSOR 
(CSF-l-R) CFMS PROTO- 
ONCOGENE) (C-FMS) factor 1 
receptor - cat >gi| 163855 
(J03149) M-CSF receptor [Felis 
domes tic us] 


0.34 


176 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 90 
mRNA, complete cds 


1.7 


85058 


muscarinic acetylcholine 
receptor - fruit fly acetylcholine 
receptor [Drosophila 
melanoeaster] 


0.20 


177 


AF077352 


Chlamydomonas 
reinhardtii myosin 
leavy chain 


1.7 


728901 


ACROSOMAL PROTEIN 

western baboon 
>gi(298488|bbs|127113 
(S56458) SP-10=intraacrosomal 
protein [Papio papio=baboons. 
Peptide. 285 aa] [Papio 
larnadryas] 


0.20 


178 


Z92788 


Caenorhabditis 
elegans cosmid 
F53B8 T complete 
sequence 
Caenorhabditis 
clesansl 


1.7 


746516 


(U23517)D1022.7 
Caenorhabditis elegans) 
>qi!325865 I elceans] 


0.068 
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Nearest Neiehbor f BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(X87883) mitochondrial capsule 




179 


AF002217 


Raistonia eutropha 
megaplasmid pHGl 
nitric oxide reductase 
(norB) gene, 
complete cds 


1.7 


1143538 


selenoprotein [Rattus 
norvegicus] >gi|l354I35 
fU48702) mitochondia 
associated cysteine-rich protein 
SMCP 


0.039 


180 


D30749 


Rat mRNA for 
protein tyrosine 
phosphatase 


1.7 


1228035 


(D83776) The KIAA0191 gene 
is ex Dressed ubiauitouslv : The 
KIAA0191 protein retains the 
C2H2 zinc-finger at its N- 
terminal region. [Homo sapiens] 


O008 


181 


M15202 


Rit fa^t ^Icelptnl TnT 

gene encoding 
troponin T isoforms, 
complete cds. 


1.7 


731172 


SKIN SECRETORY PROTEIN 
XP2 PRECURSOR 


4e-04 


182 


L07592 


Human peroxisome 
proliferator activated 
receptor mRNA, 
complete cds. 


1.7 


• 

4033414 


PUTATIVE IMPORTIN BETA- 
4 SUB UNIT 


2e-06 


183 




Dendrobium 
crumenatum ACC 
synthase gene. 


1 7 

i . i 




ASPARTYL-TRNA 
SYNTHETASE synthetase 

rT^a<*itltic ciiKtilicl 




184 


AF034970 


Homo sapiens 
docking protein 
(DOK-2) mRNA, 
complete cds 


1.7 


2289097 


(U78737) 

alpha( 1 ,3)fucosyltransferase 
[Cricetulus griseus] 


8c 12 






L.Iongitlorum mKNA 
encoding calmodulin. 










185 


ZI2839 


> :: 

gb|L189I2|LILCAJLM 
ODU Lilium 
longiflorum 
calmodulin mRNA, 
complete cds. 


1.7 


251 1747 


(AF023270) probable 
transcriptional regulator dre4 


4e-12 



1S-7 
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SEQ 
ID 



Nearest Neighbor (BlasiN vs. Gen bank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



186 



187 



188 



189 



DESCRIPTION 



X53459 



K02668 



AB008375 



L36603 



Equine arteritis virus 
(EAV) RNA genome 

> :: 

emb|A45589|A45589 
Sequence I from 
Patent W09519438> 

emb|A58849|A58849 
Sequence I from 
Patent WO970O963 > 

gb|AR013959|AR01 
959 Sequence 1 from 
patent US 5773235 



z. coli ddl gene 
encoding D-aIanine:D 
alanine ligase and 
tsQ and fts A genes, 
complete cds, and 
\sZ eene. 5' end. 



-lomo sapiens mRNA 
r osteoblast specific 
cysteine -rich protein. 
complete cds 



'seudomonas cepacia 
clone Psudom70-1) 

heat shock protein 70 

(hsp70) gene. 

complete cds 



1.7 



P VALUE 



3979817 



1.7 



1.7 



1.7 



3879121 



2496945 



2661842 



/St 



(270033) Widk simiUiij tu 
Human tyrosine- protein kinase 
CSK (SW;CSK_HUMAN); 
cDNA EST EMBL:C 10908 
comes from this gene; cDNA 
EST EMBL:C12822 comes 
from this gene; cDNA EST 
yk40Sc2.3 comes from this 
gene; cDNA EST yk408c2.5 . 
Human tyrosine -protein kinase 
CSK (SW:CSK_HUMAN); 
cDNA EST EMBL:C 10908 
comes from this gene; cDNA 
EST EMBL;C 12822 comes 
from this gene; cDNA EST 
yJc408c2.3 comes from this 
gene; cDNA EST yk4Q8 cZ5 . 



(Z703 10) predicted using 
Gene finder; Similarity to Mouse] 
ankyrin (PIR Acc, No. S37771); 
cDNA EST EMBLT01923 
comes from this gene; cDNA 
EST EMBL:D32335 comes 

torn this gene; cDNA EST 
EMBL.-D32723 comes from this , 
gene; cDNA ES... Gene finder; 
Similarity to Mouse ankyrin 

PIR Acc. No. S3777 1 ); cDN A 
EST EMBL:T01923 comes 

rom this gene; cDNA EST 
EMBL:D32335 comes from this 
gene; cDNA EST 
EMBL.D32723 comes from this 
eene;cDNA ES . .. 



HYPOTHETICAL 55.9 KD 
PROTEIN EEED8.6 IN 
CHROMOSOME II >gi[733603 
(U23484) No definition line 
ound [Caenorhabditis elegans] 



(Y15732) DNA polymerase beta| 
fXenopus laevis] 



le-I4 



2e-19 



le-19 



6e-20 
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Nearest Neighbor {BlastN vs. GenbaniO 


Nearest Neighbor (BiastX vs. Non-Redundant Proteins) 


SEQ 
ID 






n y at r re 
r VALUE 






r VALUE 












HYFOTHJbTHJAL /w KD 




190 


Z49760 


P-blakesleeanus 
mRNA GTP 
cyclohydrolase I 


1,7 


1731181 


PROTEIN C14A4.!J IN 

LHRUMOSOME II 
>gi|3874230|gnl|PrD|e 1 35 1 6 1 8 
protein (Swiss Prot accession 
number P38376); cDNA EST 
yk220el0.5 comes tirom this 
gene [Caenorhabditis elegans] 


3e-21 


191 


U52428 


Human facty acid 
synthase gene, partial 
cds 


1.7 


- 4226073 


(Af 125443) contains similarity 
to S. pombe phosphatidyl 
synthase (GB:Z28295) 
[Caenorhabditis elegans] 


6e-25 


192 


U 12767 


Human mitogen 
induced nuclear 
orphan receptor 


1.6 


<N0NE> 


<NONE> 


<NONE> 


193 


Z63478 


H.sapiens CpG DNA, 
clone 85 a 12. forward 
read cpg85al2.ftla . 


1.6 


<NONE> 


<NONE> 


<NONE> 


194 


AF084375 


Homo sapiens 
inversin protein, 
exons 8 and 9 


1.6 


<NONE> 


<NONE> 


<NONE> 


195 


AE001 1 14 


Archaeoglobus 
fulgidus section 165 
of 172 of the 
complete genome 


1.6 


■ . — ■ — — 

<NONE> 


<NONE> 


<NONE> 


196 


AF084375 


Homo sapiens 
inversin protein, 
exons 8 and 9 


1.6 


<NONE> 


<NONE> 


<NONE> 


197 


U24217 


Kluyveromyces lactis 
RNA polymerase II 
largest sub unit gene, 
partial cds 


1.6 


<NONE> 


<NONE> 


<NONE> 


198 


AE000580 


Helicobacter pylori 
26695 section 58 of 
134 of the complete 
genome 


1.6 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












■ 




199 


X62083 


H.sapiens mRNA for 
Drosophila female 
sterile homeotic 
(ron) nomologue > .. 
gb|M806I3|HUMFS 
HG Human homolog 
of Drosophila female 
sterile homeotic 
mRNA, complete cds. 


1.6 


<NONE> 


<NONE> 


<NONE> 


200 


M28064 


Plasmodium 
brasilianum DNA 
homologous to the 
histidine-rich knob 
protein region of 
Plasmodium 
falciparum. 


• 

1.6 


457495 


(M26647) ORF X 
[Saccharomyces cerevisiae] 


8.4 


201 


U03114 


Streptomyces aibus 
lipase precursor (lip) 
gene, complete cds, 
and unidentified 5' 
ORF and 3' ORF. 
partial cds. 


1.6 


3638957 


(AC004877) sco-spondm-mucin- 
like; similar to P98167 uncertain 
[Homo sapiens] 


7.8 


202 


U88422 


Strix varia oocyte 
maturation factor 
Mos (c-mos) proto- 
oncosene. partial cds 


1.6 


137618 


VITAMIN D3 RECEPTOR 
(VDR) receptor [Rattus 
norvegicus] 


6.4 


20 j 


M68519 


Human pulmonary 
surfactant-associated 
protein SP-A 
(SFTP1) gene, 
complete cds. 


1-6 


' 3875423 


/■■not t*>\ rr\T a i £. 

(Z381 12) E03A3.6 
[Caenorhabditis elegans] 


4.9 


204 


AF044575 


Homo sapiens 
transcription factor 
POU4F3 


L6 


2133625 


GAB A transport protein - 
tobacco hornworm 


4.7 


205 


L48476 


Homo sapiens 
(subclone 3_eI0 from 
PI H21) DNA 
sequence. 


1.6 


3687297 


(AJ005588) 5-epi-aristolochene 
synthase 


4.6 


206 


M18630 


Rat CNS 2\3<yclic 
nucleotide 3- 
phosphodiesterase 


1.6 


3880315 


(281 1 33) Similarity to Human 
mRNA product KIAA0077 
(TR:Q 14997) [Caenorhabditis 
elecans] 


3.7 
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Nearest Neighbor {BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















207 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA. complete 
cds 


1.6 


267068 


TUMOR- ASSOCIATED 
ANTIGEN L6 


3.6 


208 


U53448 


Babesia microti heat 
shock protein 70 
(hsp70) gene, 
complete cds 


1.6 


• 

1255429 


(U53155) strong similarity to 
the carboxyl two-thirds of vaiyl- 
tRNA synthetases 
[Caenorhabditis elegans] 


2.2 


209 


AF084367 


Homo sapiens 
inversin protein 
mRNA, complete cds 


1.6 


1730076 


PROBABLE 
SERINE/THREONINE- 
PROTEIN KINASE CY49.28 
>gi| 1 370255|gnl|PID|e247094 
(Z73966) pknJ 


1.2 


210 


D55635 


Yeast dis 1 + gene for 
p93disl, complete 
cds 


1.6 


3128353 


(AF0I0496) maltose transport 
inner membrane protein 


1.2 


211 


AF035756 


Streptomyces sp. 2- 
dehydro-3- 
deoxyphosphohepton 
ate aldolase gene, 
panial cds 


1.6 


853971 


(X83413) DR5 [Human 
herpesvirus 6] >gi|853972 
(X83413) DR5 [Human 
herpesvirus 6] 


0.97 


212 


X73479 


O cuniculus rPTPA 
mRNA 


1.6 


3413810 


(Y17034) Bassoon [Mus 
muse ul us] 


0.94 


213 


X98330 


H.sapiens mRNA for 
ryanodine receptor 2 


1.6 


2072986 


(U95I42) putative G-protein- 
coupled receptor G-protein- 
coupled receptor [Arabidopsis 
thaiiana] 


0.73 


214 


X64194 


P.anserina FMR i 
°ene exons 1 and 2 


1.6 


128014 


NECD1N >gi|9l 129|ptr||JN0148 
necdin, brain - mouse 
>gi|200020 (M80840) necdin 
Mus musculus] 


0.42 


215 


292788 


i_ aenorn a do ius 
elegans cosmid 
F53B8, complete 
Sequence 
[Caenorhabditis 
elesansl 


1.6 


746516 


(U235 17) D 1022.7 
Caenorhabditis elegans] 
>ei|3258651 eleaans] 


0.19 


216 


AE000888 


Methanobacterium 
thermoautotrophicum 
from bases 1098908 
to 1112186 (section 
94 of 148) of the 
complete genome | 


1.6 


462415 


INTERFERON- ALPHA/BETA 
RECEPTOR ALPHA CHAIN 
PRECURSOR ( IFN- ALPHA - 
REC) >gi|346520|pirI)S27387 
interferon alpha receptor type 1 - 
X)vine >gi|432 


0.001 



I (pi 
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SEQ 
ID 



Nearest Neighbor (BlasiN vs. Genbank) 



ACCESSION 



DESCRIPTION 



217 I AB008375 



218 I M25312 



219 I AB012882 



220 U29487 



221 X74760 



P VALUE 



Nearest Neighbor fBlastX vs. Non-Redundant Proteins) 



ACCESSION 



Homo sapiens mRNA 
for osteoblast specific 
cysteine-rich protein, 
complete cds 



Orang-utan involucrin 
gene, complete cds. 



Cyprinus carpio 
mRNA for MyoD, 
complete cds 



1.6 



1.6 



Caenorhabditis 
elegans cos mid 
C09C7 



1.5 



1.5 



vl.musculus mRNA 
or Notch 3 



222 I U72396 



Lycopersicon 
esculentum class II 
small heat shock 
protein Le-HSPl7.6 
mRNA, complete cds 



223 I U42391 



224 M92296 



225 I X94144 



Human myosin- DCb 
mRNA, complete cds 

Pongo pygmaeus 
gamma- 1 and gamma 
2 globin genes, 
complete cds. 



1.5 



1.5 



C japonica mRNA for 
QNR-71 protein 



DESCRIPTION 



P VALUE 



2496945 



1.5 



1.5 



3875131 



HYPOTHETICAL 55.9 KD" 

PROTEIN EEED8.6 IN 

CHROMOSOME II >gi|733603 

(U23484) No definition line 

found [Caenorhabditis elegans] \ le-18 
|(Z70750) similar to vanadate 

resistance protein 

transmembranous domains 

[Caenorhabditis elegans] | 3e-26 



1.5 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



1364094 



integral membrane protein - 

Streptomyces prist inaespiral is 

>gi|872306 (X84072) integral 

[membrane protein 

[[Streptomyce s pristinaespiralisl 
bXUliLULANASIt 11 r 1 



4.3 



121855 



3688428 



186413 



2745737 



PRECURSOR cellulose 1,4-beta 
cellobiosidase (EC 3.2.1.91) n 
precursor - fungus (Trichoderma 
reesei) 1,4- beta- cellobiosidase 
(EC 3.2.1.91) II -fungus 
cellobiohydrolase II 
[Trichoderma reesei] 



(AJ01 1534) sucrose synthase 



(M13l44)inhibinA[Homo 
sapiens] 



(AF029791) UDP- 
Gal:betaGlcNAc beta 1,3- 
galactosyltranferase-II [Mus 
musculus] 



4.3 



4.2 



0.22 



3e-08 



wogi025MIO!^ 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















226 


AB014557 


Homo sapiens mRNA 
for KIAA0657 
oroiein rjartial cds 


I 5 


1 2 1 ^092 


(X90568) Protein sequence and 
annotation available soon via 
Swiss- Prot; available at present 
via e-mail from 
LABEIT@EMBL- 


*te- u 


227 


AF000948 


Borrelia burgdorferi 
oligopeptide 
Dermease homoloe 
OppAIV (oppAIV) 
gene, complete cds 


1.3 


<none> 


<NONE> 


<NONE> 


228 




Mus musculus 
RAB/Rip protein 


i i 

i -j 




MYC PROTO-ONCOCENE 
PROTEIN (C-MYC) proto- 
oncoaene [Sus scrofa] 


2.0 


229 


U3895 1 


Drosophila 

1 1 1 CI <1I l\J t. iXd t 

vacuolar ATPase 
subunit E 


1.1 


<NONE> 


<NONE> 


<NONE> 


230 


AF027148 


myogenic 

determining factor 3 


I.I 


3172134 


(~[ TOO'? HO \ DMA nnlvm»riCA TT 

yuyvzuzr) t\sss\ polymerase ii 
largest subunit [Bonnemaisonia 
harnifera] 


2.3 


231 


AF079310 


Mus musculus histone 
dparptvla^e 3 

\A \s ^ Vr \ Tr |y JL ~> 

(Hdac3) gene, exons 
4 through 15 and 
comolete cds 


1.0 


1657601 


(U66220) unknown 

Nannncviti^ eteHervtl 




232 


X52134 


P.radiata lac gene for 
laccase 


0.95 


996020 


(X91638) BRM protein [Gallus 
gall us] 


0.31 


233 


D89016 


Human mRNA for 
Neuroblastoma, 
complete cds 


0.93 


<NONE> 


<NONE> 


<NONE> 


234 


X76392 


Cfamiliaris VIP36 
(vesicular integral- 
membrane protein of 
36 kDa) mRNA 


0.93 


4176446 


(AL022238) dJ1042K 10.2.1 
'novel protein with probable 
rabGAP domains and Src 
lomology domain 3) 


7e-8l 


235 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


0.90 


<N0NE> 


<NONE> 


<NONE> 



i<p3 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












" bb 1 2 FRU 1 hlN FKliLUKJ>UK 




236 


AE000991 


ATCnaeogioDus 
fulgidus section 1 16 
of 172 of the 
complete genome 


0.90 


1176579 


fLAKLY (J I niANSonprrr 

>gi|1362345|pir||S55862 
probable membrane protein 
YNL327w - yeast 
(Saccharomyces cerevisiae) 
cerevisiaej 

>g i 1 1 302445 |gnl|PID|e239572 
(Z71603) ORF YNL327w 
[Saccharomyces cerevisiael 


6.9 


237 


Z35922 


S.cerevisiae 
chromosome II 
reading frame ORF 
YBR053c 


0.86 ' 


<NONE> 


<NONE> 


<NONE> 


238 


U47331 


Rattus norvegicus 
metabotropic 
glutarnate receptor 4b 
mRNA. complete cds. 


0.82 


1550703 


(Z80225) hypothetical protein 
Rv2662 


4.1 




X72810 


Rsapiens Ig germline 

t t 

kappa-chain gene 
variable region (L3) 


0.69 


3023063 


(AF052587) F14 [Xylella 
fastidiosa] 


6.7 


240 


ZU700 


Escherichia coli 
genes faeG, faeH, 
fael, faeJ and IS629- 
like insertion 
sequence. > :: 

HIJ E.coli faeH, fael 
and faeJ genes 
encoding FaeH, Fael 
and FaeJ proteins 


0.69 


' 2347188 


(AC002338) Iaccase isolog 
Arabidopsis thaliana] thai i ana] 


3.9 


241 


U71597 


>*hrynosoma 
douglassii NADH 
dehydrogenase 
subunit 4 (ND4) 
^ene. mitochondrial 
gene encoding 
mitochondria] 
protein, partial cds 


0.65 


<NONE> 


<NONE> 


<NONE> 



)<9t( 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












Glutamyl- nw a 




242 


Z77798 


Ammonia species 
LSU rRNA gene 
(partial; isolate Tr S 
5" clone 16^ 


0.64 


1174506 


SYNTHETASE glutamate- 
tRNA ligase (EC 6.1.1.17)- 
Haemophilus influenzae (strain 
Rd KW20) >gi|l573240 
(U32713) glutamyl-tRNA 
synthetase (gltX) [Haemophilus 
influenzae Rdl 


1 2 


243 


D25542 


Human mRNA for 
golgi antigen gcp372, 
complete cds 


0.64 


1 1 1230 


ultra-high-sulfux keratin 1 - 
mouse 


le-05 


244 


M80234 


Cow dopamine 

trnncrwtpr ml? NT A 

putative cds. 


0.64 


3874972 


(Z99709) similar to Elongation 
factor Tu family (contains 
ATP/GTP binding P-loop); 
cDNA EST EMBL:D76223 
comes from this gene; cDNA 

P^X vk47RcS S enmp^ frnm thi< 

gene [Caenorhabditis elegans] 


8e-06 


245 


AB0079I8 


Homo sapiens mRNA 
for KIAA0449 
protein, partial cds 


0.64 


2833239 


IhbBiMAL Ckomn 

FACTOR RECEPTOR 
KINASE SUBSTRATE EPS8 
>gi|530823 (U 12535) epidermal 
growth factor receptor kinase 
substrate [Homo sapiens) 


2e-I4 


246 


X51754 


Human U266 
rearranged DNA for 
Iambda- 

immunoziobulin li°ht 

"441* i \^ t J '' 1 4^ lift ** Ik * • » 

chain 


0.63 


2072301 


(U95102) mitotic 
DhosDhoorotein 90 f XenODUS 
aevis] 


1.5 


247 


AE001554 


Helicobacter pylori, 
strain J99 section 1 15 
of 132 of the 
complete eenome 


0.62 


<NONE> 


<NONE> 


<NONE> 


248 


Z64067 


H.sapiens CpG DNA, 
clone 96e7, reverse 
read cpg96e7.nl a . 


0.62 


<NONE> 


<NONE> 


<NONE> 


249 


AJ223768 


Pinus sylvestris 
microsatellite DNA, 
clone SP AC 1 1.5 


0.62 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


■ 

DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















250 


AJ011592 


Bacteriophage PI ban 
eene 


0.62 


2493689 


PHOTOS YSTEM 11 10 KD 
rnvorrivrivu t cun ueiioiocsj 
>gi|2 143326|gnl|PID|e3 19090 
(Y 13328) lOkDa 
phosphoprotein [Populus 
deltoides) 


7.9 


o*; i 
Zj l 


AF027151 


survival of motor 
neuron protein 
interacting protein 1 
(SIP1) mRNA, 
complete cds 


0.62 


4007790 


( AL034463) putative single- 
strand polynucleotide binding 
protein [Schizosaccharomyces 
pombe] 


2.0 


252 


AJ00O376 


Helobdella triserialis 
mRNA for actin 


0.62 


1117968 


(U40763) CARS-Cyp [Homo 
sapiens] sapiens] 


0.90 


i->3 


M69231 


Rat thymosin beta 4 
gene (pTB4G).intron. 


0.62 


4176370 


(AC005058) similar to calcium- 
independent phosphoiipase A2; 
similar to AC004392 
(PID:g3367519) [Homo 
sapiens] 


6e-51 


i.>4 


AB021638 


Homo sapiens XI 1L2 
mRNA for XI 1 -like 
protein 2, complete 
cds 


0.61 


<NONE> 


<NONE> 


<NONE> 


255 


D26470 


Bacteroides 
gingivals DNA for 
arginyl 

endopeptidase, 
complete cds 


0.61 


<NONE> 


<NONE> 


<NONE> 


256 


J04737 


A.thaliana ATPase 
gene, complete cds. 


0.61 


<NONE> 


<NONE> 


<NONE> 


257 


U06756 


Bos taurus clone 
brnl308 

microsatellite and are- 
lp repeat region. 


0.61 


1922280 


(Y09905) snail like protein 
[Gall us gallus] 


0.51 


258 


S75756 


pI5=cyclin D- 
dependent kinases 4 
and 6- binding 
protein/pi 5 product 
{exon/intron 1 } 
[human, brain tumors. 
Genomic. 753 nt] 


0.61 


484938 


hypothetical protein 253 - 
Streptomyces griseus plasmid 
pSGl (fragment) 


0.13 


259 


L39837 


Drosophila 
melanogaster tumor 
supressor (warts) 
mRNA exons 1-8, 
complete cds. 


0.61 


i 

3875131 | 


[270750) similar to vanadate 
resistance protein 
tfansmembranous domains 
Caenorhabditis elegans) 


le-09 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 




DFSPRrPTlON 


t V /\L U C 






r VALUE 












(Af 125443) contains similarity 




260 


U52428 


Human fatty acid 
synthase gene, partial 
cds 


0.61 


4226073 


to S. pombe phosphatidyl 
synthase (GB:Z28295) 
[Caenorhabditis elegans) 


* 

2e-26 


261 


XI 5292 


Plasmodium 
falciparum gene for 
heat-shock protein 
pPf203 


0.60 


<NONE> 


<NONE> 


<NONE> 


262 


AB020663 


Homo sapiens mRNA 
for KIAA0856 
protein, partial cds 


0.60 


470341 


(U00043) No definition line 
found [Caenorhabditis elegans] 


5.7 • 


263 


U6S723 


Human checkpoint 
suppressor I mRNA, 
complete cds 


0.60 


544375 


GAL AClOSb-H INIMNu " 
PROTEIN REGULATOR 
glucose/galactose binding 
protein regulator - 
Agrobacterium tumefaciens 
>gi| 142228 (L10424) 
glucose/galactose binding 
protein regulator 


5.7 


264 


M32687 


S.griseus sporulation 
protein genes 1590 
and 1422. 


0.60 


2582017 


(AF012871) Mergla' [Mus 
musculus! 


3.3 


265 


AJ005331 


Homo sapiens 
NKCC2 gene, exon 4, 
isoform B 


0.60 


3128353 


(AF0 10496) maltose transport 
inner membrane protein 


1.5 


266 


U 14103 


Mus musculus RuL 
protein mRNA, 
complete cds. 


0.60 


4099845 


(U90533) serine protease 
inhibitor fStreptomyces fradiae) 


0.098 


267 


U95094 


Xenopus laevis XL* 
INCENP (XL- 
EMCENP) mRNA, 
complete cds 


0.59 


3282851 


(AF047897) ankyrin- like protein 
HGE-ANK [Ehrlichia sp. BDS] 


5.5 


268 


AE000872 


Methanobacterium 
thermoautotrophicum 
from bases 896604 to 
912784 (section 78 of 
148) of the complete 
genome 


0.59 


401553 


HYPOTHETICAL 24.5 KD 
PROTEIN IN NADB-SRMB 
INTERGENIC REGION 


4.3 
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Nearest Neiahbor (BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












nypuu mirorpruiKin - numan 




269 


L11871 


Galius gallus achaete- 
scute homologue 
(ASH) mRNA, 
complete cds. 


0.59 


■ 

628110 


[Human herpesvirus 4) 2 
[Human herpesvirus 4] 
>gi|1334838|gnl|PlD|e25079 4 
[Human herpesvirus 4] 
>gi|l334840jgnl|PID[e2508 l 6 
[Human herpesvirus 4] 
>gi|1334842|gnl[PID(e25067 8 

[Human herpesvirus 4] 
>gi|1334844|gnl)PID|e25069 10 
[Human, herpesvirus 4] 
>gi|1334846|gnl|PID|e25071 12 
[Human herpesvirus 4] 


4.2 


270 


AKM7114 


Oryctolagus 
cuniculus glycogen 
synthase tnRNA, 
compleie cds 


0-59 


728856 


NITROGENASE IRON-IRON 
PROTEIN ALPHA CHAIN 
(NITROGENASE 
COMPONENT I) 
(DINITROGENASE) capsulatus 
>gi|312238 (X70033) 
alternative nitroaenase 


2.4 


271 


AF027807 


Homo sapiens beta- 
casein (CSN2) gene, 
complete cds 


039 


3252932 


(AF067155) truncated rev 
protein [Human 
immunodeficiency virus type 1] 


1.5 


272 


U81787 


Human WntlOB 
mRNA, complete cds 


0.59 


3875538 


(Z67990) similar to cuticle 
collaeen 


1.4 


273 


U76036 


Apteryx australis I6S 
ribosomat RNA gene, 
mitochondrial gene 
for mitochondrial 
RNA, partial 
sequence 


0.59 


4193356 


(AF055088) ATP-binding 
cassette; PsaB [Streptococcus 
pneumoniae] 


0.83 


274 


AB0L4564 


Homo sapiens mRNA 
for KI AA0664 
protein, partial cds 


0.59 


1709851 


PTB-ASSOC1ATED SPLICING 
FACTOR (PSF) long form - 
human >gi|38458 (X70944) 
PTB- associated splicing factor 
(Homo sapiens] 


0.17 


275 


AF044171 


Homo sapiens cyclin- 
dependent kinase 
inhibitor 2D 
(CDKN2D) gene, 
partial cds 


0.59 


3925213 


(AL032626) Y37D8A.17 
fCaenorhabditis eleeans] 


3e-10 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor fBlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















276 


LI 9640 


Saccharomyces 
cerevisiae cdc2/cdc2J 
related protein kinase 
gene, complete cds. 


0.59 


3880115 


(Z8I130) T23G11.9 
[Caenorhabditis elegans) 


Ie-21 


277 


Z80999 


Human DNA 
sequence from 
cosmid E140G5 on 
chromosome 22, 
complete sequence 
[Homo sapiens] 


0.58 


<NONE> 


<NONE> 


<NONE> 


278 


YI1108 


H saDiens WNT8B 
gene 


0.58 


<NONE> 


<NONE> 


<NONE> 


279 


U8000I 


Snhvraena idiastes 
lactate dehydrogenase 

A 


0.58 


<N0NE> 


<NONE> 


<NONE> 


280 


249637 


S cerevisiae 
chromosome X 
reading frame ORF 
YJR137c 


0.58 


<NONE> 


<NONE> 


<NONE> j 


281 


X64467 


H.sapiens ALAD 
gene for 

nnrnhohnltnnccn 

synthase 


0.58 


<NONE> 


<NONE> 


<NONE>| 


282 




G.gallus hox B3 

ttiP N A 


v.JO 








283 


U68040 


Cochliobolus 
heterostrophus 
polyketide synthase 


0.58 


<NONE> 


<NONE> 


<NONE>| 


284 


AF089084 


Arabidopsis thaliana 

nnfntive auxin efflux 

carrier protein (PIN1) 
mRNA. complete cds 


0,58 


- 

<NONE> 


<NONE> 


<NONE> 


285 


U38481 


Rattus norvesicus 
ROK-alpha mRNA, 
complete cds 


0.58 


<NONE> 


<NONE> 


<NONE>| 


286 


AF0I7656 


Homo sapiens G 
protein beta 5 subunit 
mRN A. complete cds 


0.58 


3236249 


(AC004684) hypothetical 
protein [Arabidopsis thaliana] 


9.2 J 


287 


M96234 


Human glutathione 
transferase class mu 
number 4 . 


0.58 


1280073 


(U55366) Similar to cuticle 
collagen [Caenorhabditis 
elegans] 


7.1 I 


288 


AB002339 


Human mRNA for 
KIAA0341 aene. 
panial cds 


0.58 


861293 


(U28741)F35D2.1 gene 
product [Caenorhabditis 
elegans] 


7.1 1 
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Nearest Neighbor f BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



289 I U1I295 



290 I D80001 



291 



ZI1700 



292 I M77350 



293 X63787 



Neisseria 
gonorrhoeae 
carbamoyl phosphate 
synthetase 
(glutamine) small 
subunit (carA) and 
large subunit (carB) 

;enes, complete cd s. 
Human mRNA for 
KIAA0179gene. 

anial cds 



Escherichia coli 
genes faeG, faeH. 
fael, faeJ and IS629- 
ike insertion 
sequence. > :: 
emb|Zli7lO|ECFAE 
H1J E.coIi faeH, fael 
and faeJ genes 
encoding FaeH. Fael 
and FaeJ proteins 
Vlouse hair keratin 
Al (MHKAI)gene, 
complete cds. 



294 D63881 



T.thermophila gene 
for snRNA U3-2 



Human mRNA for 
KIAA0l60gene, 
artial cds 



295 | U39378 



296 I X87987 



ymnoc arena 

Imexicana 16S 
ribosoma! RNA gene, 
mitochondrial gene 
encoding 

mitochondrial RNA. . 
anial sequence 

P.pastoris PRC1 gene 

> :: 

dbj|El2103|El2I03 
DNA encoding 
precursor of protease 
rom Pichia pastoris 



ACCESSION 



DESCRIPTION 



P VALUE 



0.58 



0.58 



2425135 



4097223 



(AF020283) DG2044 gene 
product [Dictyosteliurn 

discoideumj 

l(U49836) gamma-glutamyl 
transpeptidase precursor [Brugial 
[malayi] 



0.58 



0.58 



2347188 



141165 



(AC002338) laccase isolog 
[Arabidopsis thaliana] thalianal 

[HYPOTHETICAL 8.3 KD 
PROTEIN >gi|62 179 



0.58 



(AB004461) DNA polymerase 
alpha catalytic subunit [Oryza 
2826900 feativa] 



0.58 



1934730 



(U95036) germin-like protein 
[[Arabidopsis thalianal 



0.58 



2194131 



0.58 



3914197 



(AC002062) Similar to 
[Synechocystis antiviral protein 



OCCLUDIN >gi|1276983 
(U49221)occludin [Canis 
familiaris] 

>gi| 15891 8 1 |prf||22 10347D 
[occludin [Canis familiaris] 



4.1 



3.1 



3.1 



3.1 
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Nearest Neighbor fBlastN vs. G 


renbank) 


Nearest Neigh b 


or (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VAI 1TP 
















-L y t 


X' 7 5782 


A.thatiana (L.Heynh.; 
chloroplast mRNA 
tor recomoinant Aro- 
kinase 


U.JO 


MX) AAA 


(UjoDzyj UKrLA protein 
[Homo sapiens] 


2.4 


298 


M64848 


Mouse platelet- 
derived growth factor 
0 chain musculus 
platelet-derived 
growth factor beta- 
chain (sis) gene, exon 
5. 


0.58 


• 3025832 


(AF055985) pyrrol idone-rich 
antigen [Onchocerca volvulus] 


1.4 




AE001460 


Helicobacter pylori, 
strain J99 section 21 
of 132 of the 
complete genome 


U-3<S 


oott i no 


(AF037454) ubiquitin protein 
lisase [Mus musculus] 


11 


inn 


X65720 


M.musculus gene for 
protein kinase C- 
gamma (exonl and 
exon 2) 


V.DO 


A 1 Q"1Q< 

418 jyj 


CtiDl FRO'IEIN 
>gi|320737|pir||S30818 
nypotnetical protein YfcKlo4w - 
yeast (Saccharomyces 
cerevisiae) >gi|603404 
(U 189 17) Chdlp: transcriptional 
regulator [Saccharomyces 
cerevisiae] 


1.1 


301 


AF043130 


Arabidopsis th a liana 
lactate dehydrogenase 


0.58 


3024637 


SEX- DETERMINING 
REGION Y PROTEIN 
determining protein [Mus 


062 




D28116 


Human genes for 
collagen type IV 
alpha 5 and 6, exon 1 
and exon V 


0.38 


\ AC Ol C/\ 

1458250 


(U64835) T09D3.3 
[Caenorhabditis elegans] 


0.36 




AE001075 


Archaeoglobus 
fulgidus section 32 of 
1 /Z or tne complete 
genome 


U.JO 


OTTO'S "3 


(£7/99 1) nypotnetical protein 
Rv0336 


0.36 


304 


AF003948 


Rhodococcus opacus 
chloromuconate 
cycloisomerase 
transposase homolog 
genes, complete cds 


0.58 


477072 


mucin 7 precursor, salivary - 
human 


0.28 


305 


U 10692 


Human MAGE-7 
antigen (MAGE7) 
pseudogene, complete 
cds. 


0,58 


3287858 


HOMEOBOX PROTEIN HOX- 
Cll 


0.054 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















306 


AF003948 


Rhodococcus opacus 
chloromuconate 
cycloisomerase 
transposase homolog 
genes, complete cds 


0.58 


3551821 


(AF058803) mucin 4 [Homo 
sapiens] 


0.041 


307 


X99350 


H.sapiens HFH4 
gene, exon 1 and 
joined CDS 


0.58 


137483 


VAV PROTO-ONCOGENE 
>gi|55221 (X64361)proto- 
oncogene [Mus musculus] 


0.024 


308 


AJ234282 


Homo sapiens mRNA 
for Ig heavy chain 
variable region, clone 
C 


0.58 


i 

3264846 


(AC003682) R27945.2 [Homo 
sapiens] 


0.018 


309 


AF079310 


Mus musculus hisione 
deacetylase 3 
(Hdac3) gene, exons 
4 through 15 and 
complete cds 


0.58 


1657601 


(U66220) unknown 
[Nannocystis exedens] 


0.014 


310 


AF019367 


Human thiopurine 
methyl transferase 
(TPMT) gene, exons 
6 and 7 


0.58 


3283352 


(AF063020) lens epithelium- 
derived growth factor [Homo 
sapiens] 


0.011 


311 


X65720 


M. musculus gene for 
protein kinase C- 
gamma (exonl and 
exon 2) 


0.58 


1790878 


(U38291) microtubule- 
associated protein la [Homo 
sapiens] 


0.008 


312 


AB011155 


Homo sapiens mRNA 
for KIAA0583 
protein, partial cds 


0.58 


1351166 


SYNAPSINS IA AND IB 
>gi|163713 


0.006 


313 


X63692 


H.sapiens mRNA for 
DNA 


0.58 


1817548 


(D84307) phosphoethanolamine 
cytidylyl transferase [Homo 
sapiens] 


0.001 


314 


U53746 


Feline 

immunodeficiency 
virus isolate FIV- 
Pco336-8 pol 
polyprotein (pol) 
£ ene. partial cds 


0.58 


2246532 


(U93872) ORF 73, contains 
large complex repeat CR 73 


2e-05 


— ■ 

315 


K00436 


Rattus norvegicus 
(clone rtl-1) pseudo- 
GIv-tRNA sene. 


0.58 


206712 


(M64793) salivary proline-rich 
protein [Rattus norvegicus] 


le-05 



( 1 
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Nearest Neighbor (BtastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















316 


S79632 


HSFkheat shock 
factor 2 {alternatively 
spliced, splice 
junction region] 
[mice, CBA/J, testis, 
Genomic* 120 nt. 
segment 2 of 3) 


0.58 


4038594 


(AJ222798) tDETl protein 
[Lvcopersicon esculentum] 


3e-06 


317 


D43964 


Rat liver mRNA for 
Kan-h comolete cds 


0.58 


1280135 


coded lor Dy C. 
elegans cDNA cm21e6; coded 
for by C. elegans cDNA 
cm01e2; similar to melibiose 
carrier protein 

(thiomethylgalactoside permease 

n> 


le-08 


318 


AB007918 


Homo sapiens mRNA 
for KIAA0449 
protein, partial cds 


0.58 


2833239 


EPIDERMAL GROWTH 
FACTOR RECEPTOR 
KINASE SUBSTRATE EPS 8 
>gi|530823 (U12535) epidermal 
growth factor receptor kinase 
substrate [Homo sapiens] 


3e-13 


319 


AB001466 


Homo sapiens mRNA 
for EfsL complete 
cds 


0.58 


2943716 


(D45027) 25 kDa trypsin 
inhibitor [Homo sapiens] 


2e-14 


320 


Z11701 


Saccharomyces 
cerevisiae ERE1 gene 
for putative protein 
kinase. 


0.58 


38801 15 


(Z81130) T23G11.9 
[Caenorhabditis elegans] 


9e-21 


321 


Z49535 


S.cerevisiae 
chromosome X 
reading frame ORF 
YJR035w 


0.58 


4106562 


(Z83819) dJ146H21.2 (similar 
to CYTOCHROME B-245 
HEAVY CHAIN) [Homo 
sapiens] 


3e-33 


322 


M62506 


Sjcerevisiae DBF20 
gene, complete cds. 


0.57 


<NONE> 


<NONE> 


<NONE> 


323 


X05944 


Yeast PSS gene for 
shosphatidylserine 
synthetase 


0.57 


<NONE> 


<NONE> 


<NONE> 


324 


D38536 


Snail gene for ADP* 
ribosyl cyclase, 
complete cds 


0.57 


<NONE> 


<NONE> 


<NONE> 


325 


Z750O4 


S.cerevisiae 
chromosome XV 
reading frame ORF 
YOR096w 


0.57 


<NONE> 


<NONE> 


<NONE> 


326 


L77034 


3omo sapiens 
(subclone 10__el0 
from PI Hl6) DNA 
sequence. 


0.57 


<NONE> 


<NONE> 


<NONE> 



73 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundam Pn 


Meins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Cyprinus carpio c- 










327 


D37887 


myc gene for c-Myc, 
complete cds 


0.57 


<NONE> 


<NONE> 


<NONE> 


328 


ABO 14562 


Homo sapiens mRNA 
for KIAA0662 
protein, partial cds 


0.57 


197406 


(M57576) Ig kappa chain fMus 
musculus] 


8.9 


329 


269651 


Human UNA 
sequence from 
cosmid L75B9, 
Huntington's Disease 
Region, chromosome 
4pl6.3 


0.57 


1079280 


chaperonm containing I CP- 1 
complex gamma chain - African 
clawed frog >gi|793886 
(X84990) Ccte 


8.9 


330 


D89285 


Mesocricetus auratus 
mRNA for inter-alpha 
trypsin inhibitor 
heavy chain I, 
complete cds 


0.57 


134132 


RYANODINE RECEPTOR, 
SKELETAL MUSCLE 


6.9 


331 


Z48951 


Sxerevisiae 
chromosome XVI 
cosmid 9723 


0.57 


4210432 


(AJ 130783) APC2 protein [Mus 
musculus] 


5.3 


332 


X95573 


A.thaliana mRNA for 
salt- tolerance zinc 
finger protein 


0.57 


1174828 


TYROSINE 
DECARBOXYLASE 2 
4.1.1.25) - paisley >gi| 169671 
(M96070) tyrosine 
decarboxylase [Petroselinum 




333 


U95094 


Xenopus laevis XL- 
INCENP (XL- 
INCENP) mRNA, 
complete cds 


0.57 


' 465646 


PROBAiJLk'ABC 
TRANSPORTER ATP- 
BINDING PROTEIN IN 
NTRA/RPON 5TIEGION 
(ORF1) Azorhizobium 
caulmodans >gi|3 1 1388 
(X69959) ORF1 


4.0 


334 


AE001116 


Bonrelia burgdorferi 
(section 2 of 70) of 
the complete genome 


0.57 . 


2314735 


(AE000653) Na+/H+ antiporter 
(nhaA) [Helicobacter pylori 
266951 


4.0 


335 


234291 


R.norvegicus mRNA . 
for putative chloride 
channel. 


0.57 


1350832 


DNA-DLRLCILD RNA 
POLYMERASE I SECOND 
LARGEST SUB UNIT (RNA 
POLYMERASE I SUBUNIT 2) 
chain RPA2 - Euplotes 
octocarinatus (SGC9) 
>2i|578407 octocarinatus] 


3.0 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(Z8 1063) similar to Actimn-type 




336 


D88255 


Homo sapiens A30 
Vk germ line gene, 
partial cds 


0.57 


3875983 


actin-binding domain containing 
proteins [Caenorhabditis 
eleeans] 


3.0 


337 


AF037261 


Homo sapiens SH3- 
containing adaptor 
molecule- 1 mRNA, 
complete cds 


0.57 


* 

1397341 


(U6iy33) Mffliiajf to mesin-iiKe 

protein; coded for by C. elegans 
cDNA yk!84h5.3; coded for by 
C. elegans cDNA yki84h5.5; 
coded for by C. elegans cDNA 
ykl3d7.3; coded for by C. 
elegans cDNA ykl3d7.5; coded 
for by C elegans cDNA 
yk3Iel,5;co~. >gi|3493541 
(AF057567) kinesin-like protein 
ZEN-4a [Caenorhabditis 
elesans] 


2.3 


338 


U26595 


Rattus norvegicus 
prostaglandin F2a 
receptor regulatory 
nrotein orecursor 
mRNA. complete cds 


0.57 


2773160 


(AF039656) neuronal tissue- 
enriched acidic Drotein IHorno 

will IVilWU L/l VlVIJl [A^UIIRV 

sapiens] 


2.3 


339 


X69903 


R.norveeicus mRNA 
for interleukin 4 


0 57 


2649193 


(AE001009) quinone-reactive 
Ni/Fe-hydrogenase B-type 
cytochrome subunit (hydC) 

f ArchAfrfmlnhu^ fulffidusl 


1 8 


340 


Z74825 


S.cerevisiae 
chromosome XV 
reading frame ORE 
YOL083w 


0.57 


1458319 


(U64S46) F47D2.5 gene 
product [Caenorhabditis 
eleeans] 


1.4 


341 


AJ131469 


Foot-and-mouth 
disease virus O vpl 
gene, strain O/A/58 


0,57 


91206 


proline-rich protein - mouse 
(fragment) musculus] 


1.4 


342 


AF0U360 


Mus musculus 
regulator of G-protein 
signaling 7 (RGS7) 
mRNA. complete cds 


0.57 


542514 


selsolin - American lobster 


0.80 


343 


AF0U360 


Mus musculus 
regulator of G-protein 
signaling 7 (RGS7) 
mRNA. complete cds 


0.57 


1078946 1 


gelsohn - American lobster 
>gi|4523 13 gelsolin (Homarus 
americanus] 


0.80 



US 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SFO 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















344 


L39210 


Homo sapiens inosine 
monophosphate 
dehydrogenase type II 
gene, complete cds 


0.57 


559526 


(X77466) 98.8kD polyprotein 
[Strawberry latent ringspot 
virus] 


0.79 


345 


U81523 


Human endometrial 
bleeding associated 
factor mRNA, 
complete cds 


0.57 


2 11499 


(K01 702) HMW/LMW collagen 
subunit precursor [Gallus gall us] 


0.79 


346 


U46561 


Tetrahymena 
thermophila 
polyubiquitin (TTU3) 
gene, complete cds, 
and RNA polymerase 
II subunit 2 (RPB2) 
gene, partial cds 


- 

0.57 


2506493 


HYPOTHETICAL 100.5 KD 
PROTEIN IN IAP-CYSH 
INTERGENIC REGION 
>gi|8S2654 (U29579) alternate 
gene name ygcB; ORF_f888 
[Escherichia colt] >ei|17891 19 


0.60 


347 


X95543 


C.japonica mRNA for 
legumin (clone 
CjLeg31) 


0.57 


1709261 


NbUKOblLAMfc.Nl lKifLfcl 
M PROTEIN (1 60 KD 
NEUROFILAMENT 
PROTEIN) (NF-M) 
>gi|I083164|pir||S55395 
neurofilament protein M - rabbit 
(fragment) >ei [854353 


0.46 


348 


Y 17282 


Homo sapiens mRNA 
for cytokeratin type II 


0.57 


3044086 


(AF055904) unknown 
[Mvxococcus xanthus] 


0.45 


349 


X00716 


Frog mRNA fragment 
for alpha- A2- 
crvstallin 


0.57 


3406654 


(AF079369) transcriptional 
repressor TUP 1 [Dictyostelium 
discoideum] 


0.20 


350 


X53238 


Klebsiella sp. 
bacteriophage Kl 1 
gene 1 for RNA 
polymerase 


0.57 


1228093 


(Z469I3) polyketide synthase 


0.16 


351 


X99012 


H. sapiens FUS gene, 
exon 12 


0.57 


t 243898 


(S78S97) GOR=antigenic 
epitope [chimpanzees. Peptide, 
427 aa] [Pan] 


0.090 


352 


AL0087I1 


Human DNA 
sequence from PAC 
390N22 on 
chromosome Xp22.2 


0.57 


1469545 


(U53585) fibronectin attachment 
protein fMvcobacterium avium] 


0.053 


353 


S74506 


SOX9 [human, fetal 
brain. Genomic. 1494 
nu segment 3 of 3] 


0.57 


1326350 


(U5S748) similar to potential 
transmembrane domains in S. 
cerevisiae nulcear division 
RFT! protein fSP:P38206) 


0.017 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















354 


D25542 


Human mRNA for 
golgi antigen gcp372, 
complete cds 


0.57 


4063399 


(AF 10257 5) cell surface protein 
DTFA [Dicryostelium 
discoideum] 


0.005 


355 


AB015426 


Mus musculus mRNA 

for alpha 1,3- 
fucosyltransferase DC, 
complete cds 


0.57 


2661842 


(Y 15732) DNA polymerase beta 
[ Xenopus laevis] 


7c- n 


356 


X5 1 394 


Xenopus mRNA for 
APEG protein, 
containing a highly 
repetitive amino acid 
sequence 


0.57 


1929056 


(YI 2090) putative 3»4- 
dihydroxy-2-butanone kinase 
f Lycopersicon esculentum) 




9e-12 


357 


AB007918 


Homo sapiens mRNA 
for KIAA0449 
protein, partial cds 


0.57 


2833239 


ElNDfifeMAL GROWTH 

FACTOR RECEPTOR 
KINASE SUBSTRATE EPS8 
>gi|530823 (U12535) epidermal 
growth factor receptor kinase 
substrate [Homo sapiens] 


3c- 13 


358 


AB001466 


Homo sapiens mRNA 
for Efsl, complete 

cds 


' 0.57 


2943716 


(D45027) 25 kDa trypsin 
inhibitor [Homo sapiens] 


2c- 14 


359 


Y00760 


Rabbit mRNA for 
adult fast skeletal 
troponin- C 


0.57 


2576348 


(AC002400) Glutamyl tRNA 
synthetase [Homo sapiens] 


2e-28 


360 


X95153 


H.sapiens brca2 gene 
exon 3 > 

emb|A62778|A62778 
Sequence 19 from 
Patent WO9719I10 


0.57 


3419847 


( AC004982) similar to yeast 
hypothetical protein ybk4; 
similar to P38164 
(PID:g586461) [Homo sapiens] 


2e-55 


361 


X85967 


B. vulgaris mRNA for 
betavulgin 


0.56 


<NONE> 


<NONE> 


<NONE> 


362 


U0925I 


Mycoplasma 
genitalium DNA 
gyrase subunit B 
complete cds, DNA 
polymerase III beta 
subunit (dnaN) and 
seryl-lRNA 
synthetase (serS) 
genes, partial cds. 


0.56 


<NONE> 


<NONE> 


<NONE> 


363 


V00158 


Chloroplast Euglena 
gracilis genes coding 
for transfer RNAs 
specific for threonine, 
glycine, methionine, 
serine and elutamine. 

■ — . Win — *■ 


0.56 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiahbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Clostridium 










364 


D88151 


perfringens DNA for 

XJ - olu n 1 1 1C . \J " «li »»1 1 1 * « v 

ligase, cortical 
fragment- lytic 
enzyme 


0.56 


<NONE> 


<NONE> 


<NONE> 


365 


U67478 


IrlCinaniX.ULl«LL3 

jannaschii section 20 
of 150 of the 
complete genome 


0.56 


<NONE> 


<NONE> 


<NONE> 


JDO 


L23800 


Tachyglossus 
aculeatus beta-globin 
homolog (HBB) 
gene, complete cds 


0.56 


- 

<NONE> 


<NONE> 


<NONE> 


367 


AB0U129 


Homo sapiens mRNA 
for KJAA0557 
protein, partial cds 


0.56 


<NONE> 


<NONE> 


<NONE> 




L77034 


Homo sapiens 
(subclone 10_el0 
from PI HI 6) DNA 
sequence. 


0.56 


<NONE> 


<NONE> 


<NONE> 


369 


Z47202 


C.albicans gene for 
TFIIIB (BRF1) 
subunit. 


0.56 


<NONE> 


<NONE> 


<NONE> 


370 


U53868 


Clostridium 
uccio Duty i ic u in 
mannitol-specific 
phosphotransferase 
system (PTS) system, 
mtlA, mtlR. mtlF, and 
mtlD genes, complete 
cds 




<i>HJlNC> 






371 


AF041259 


Homo sapiens breast 
cancer putative 
transcription factor 
(Z ABC 1) mRNA, 
complete cds 


0.56 


<NONE> 


<NONE> 


<NONE> 


372 


L42636 


Plasmodium 
falciparum variant- 
specific surface 
protein (var-7) 
mRNA. complete cds. 


0.56 


2213557 


(Z97052) hypothetical protein 


8.8 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor fBlastX vs. Non-Redundant Proteins) 


SEQ 
CD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















373 


U96180 


Human protein 
tyrosine phosphatase 
(TEPI)mRNA, 
complete cds 


0.56 


731016 


THIOREDOXIN REDUCTASE 
thioredoxin reductase (NADPH) 
[Coxiella burnetii] 


8.7 


374 


L76259 


Homo sapiens PTS 
gene, complete cds 


0.56 


2369863 


(Y 12225) Spi-l/PU.t 
transcription factor 


6.7 


375 


AF045946 


Mas musculus 
D16Jhul7 YAC 
98B3 acentric end, 
partial sequence 


0.56 


2130017 


hypothetical protein - common 
sunflower protein [Helianthus 
annuus] 


5.1 


376 


X97986 


M musculus mRNA 
for desmocollin type 
I 


0.56 


4038031 


(AC005936) hypothetical 
protein [Arabidqpsis thaliana] 


3.9 


377 


X79437 


M.musculus whey 
acidic protein (WAP) 
gene, exon 1 


0.56 


549670 


SP1NULL POLK BODY 
LUivirUiNtfv i orL4z yeast 
(Saccharomyces cerevisiae) 
>gi|486054 (Z28042) ORF 
YKL042w [Saccharomyces 
cerevisiae] >gi[66609o 
(X71621) hypothetical 42.3 kD 
protein [Saccharomyces 
cerevisiae) 


3.9 


378 


M27902 


Rat cardiac specific 
sodium channel alpha- 
subunit mRNA, 
complete cds. 


0.56 


585234 


ENDOGLUCANASE G 
PRECURSOR 3.2. i.-) CclCCG 
precursor - Clostridium 
cellulolyticum cellulolyticum] 


3.9 


379 


AF036696 


Caenorhabditis 
elegans cosmid 
F15B10 


0.56 


546071 


gp70=envelope protein 

{ endogenous provirus } host=cat 

lymphoid tissues, Peptide, 445 

aa] 


3.6 


380 


299J02 


Caenorhabditis 
elegans cosmid 
B033L complete 
sequence 
Caenorhabditis 
elegans] 


0.56 


603664 


( t ) 1 4 1 (>] ) ntirativp rever<;p 
transcriptase; ORF2; encodes aa 
motifs conserved in reverse 
transcriptases; most closely 
related reverse transcriptases are 
ihose of non-LTR 
retrotransposons. The 3' 901 bp 
of this CDS axe identical to the 
3* 901 bp ... 


3.0 


381 


L27850 


Equus caballus (clone 
T131)T-cell receptor 
}NA. V-resion. I 


0.56 


1079150 


transcription factor shn - fruit fly 


1.7 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Prx 


:> terns) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












HYPOTHETICAL 1 13.1 KD 


— — ^. 


382 


X97986 


M.muSCUJUS rrLrvlN/\ 

for desmocoilin type 
I 


0-56 


2497227 


PROTEIN IN PRE5-FET4 
INTERGEN1C REGION 
><d|1072409 (254141) unknown 


1.7 


383 


AF087455 


Didelphis virginiana 
G protein receptor 
kinase 2 mRNA, 
complete cds 


UJO 




(U 12964) contains anlcyrin-Iike 
repeats; simitar to human 
desmoplakin repeat region 

II'"*f»f»nrirh'ihflUic ^IpfmnO 


1.3 


384 


D80011 


Human mRNA for 
KIAA0189 gene, 
complete cds 




IZQjJJ 




U 


385 


AJ002272 


Mus musculus mRNA 
for HAP1-A protein, 
3' region 


0.56 


3327158 


(AB014572) K1AA0672 protein 
fHomo sapiens] 


1.0 


386 


L39210 


Homo sapiens inosine 
monophosphate 
dehydrogenase type II 
gene, complete cds 


U.JO 




coat protein - strawberry latent 
ring spot virus 


0.77 


387 


X02770 


Mouse Thy- 1.2 gene 
5' untranslated region 
and exon 1 


0.56 


3327046 


(AB0145I6) KIAA06 16 protein 
[Homo sapiens] 


0.59 


388 


AF038575 


Schizosaccharomyces 
pombe Wiskott- 
Aldrich Syndrome 
protein ho mo log 
(wspl+) gene, 
complete cds, and 
BTF3/beta-NAC 
gene, partial sequence 


0.56 


88466 


salivary proline- rich 
nhncnhnnrntein Drecursor PRHl 
(allele PDF) - human >gi| 190484 
(K03203) prepro salivary 
proline-rich protein [Homo 
sapiens] >gi|190512 


0.35 

- 


389 


ADO /4 / 


Rat mRNA for fetal 
intestinal lactase- 
phlorizin hydrolase 
precursor, pdniai 


WiJV 


2072742 


(248674) chitinase homologue 
T^esbania rostratal 


0.23 


390 


Y 12072 


G arboreum mRNA 
for farnesyl 
pyrophosphate 
synthase 


0.56 


296670 


(X07S82) Po protein [Homo 
sapiens] 


0.20 


391 


S75756 


pl5=cyclin D- 
dependent kinases 4 
and 6- binding 
protein/pli product 
{exon/intron 1 } 
[human, brain tumors. 
Genomic. 753 ni] 


0.56 


1082743 


protein kinase (EC 2.7.1.37) 
SPRK - human sapiens] 
>gi|1090771|prfl|20l9437A 
protein Tvr kinase I 


0.15 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Equus caballus rype 










392 


U62528 


II collagen mRNA, 
complete cds 


0.56 


461671 


[Segment 1 of 2] COLLAGEN 
.ALPHA HI) CHAIN 


0.030 


393 


X96877 


Creinhardtii mRNA 
for unknown lumenal 
polypeptide 


0.56 


3341678 


(AC003672) putative zinc finger 
protein [Arabidopsis thaliana] 


5e-09 


394 


S78788 


cGATA-j [chickens, 
liver. Genomic, 979 
nt, segment 4 of 4] 


■ 

0.56 


2661590 


(AL009196) 1- 

evidence=predicted by content; 

method_score=59.41; 1- 
evidence_end; 2- 
evidence=predicted by match; 2- 
match_accession=AA950019; 2- 
match_descnption=LD29959.Dp 
rime LD Drosophila 
melanosas... 


2e-ll 


IOC 

395 


AF006640 


Drosophila 
melanogaster Ste20- 
like protein kinase 
mRNA, complete cds 


0.56 


' 1109830 


(U41534) coded for by C 
elegans cDNA CEESI42F; 
Similar to helicases of 
SNF2/R.ADD4 family. 
[Caenorhabditis elegans] 


6e-12 


396 


AF006640 


Drosophila 
melanogaster Ste20- 
like protein kinase 
mRNA, complete cds 


0.56 


1 109830 


(U41534) coded for by C 
elegans cDNA CEESI42F; 
Similar to helicases or 
SNF2/RAD54 family. 
[Caenorhabditis elegans] 


4e-l3 




AE000716 


Aquifex aeolicus 
section 48 ot lUv ot 
the complete genome 


0.56 


- 

3688350 


(.AJ_0JU9yo) aJilo9Bi4.4 
(novel PUTATIVE protein 
similar to hypothetical proteins 
S. pombe C22F3.14C and C. 
elegans C l o Ai.o) [Homo 
sapiens] 


3e-66 


398 


Z36079 


S.cerevisiae 
chromosome II 
reading frame ORF 
YBR210w 


0.55 


<NONE> 


<NONE> 


<NONE> 


399 


Y 17267 


Mus musculus mRNA 
for ubiquitin 
conjugating enzyme 


0.55 


<NONE> 


<NONE> 


<NONE> 


400 


AC001461 


Homo sapiens 
(subclone 2_g5 from 
BAC H107) DNA 
sequence 


0.55 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






AlouatLa seniculus 










401 


AF019079 


ureast ana ovanan 
susceptibility 
(BRCAl)gene, 
partial cds 


0.55 


<NONE> 


<NONE> 


<NONE> 


402 


M90058 


Human serglycirt 
gene, exons 1,2, and 
3. 


0.55 


<NONE> 


<NONE> 


<NONE> 


403 


ABO 1 3469 


Mus musculus CLM2 
gene for cytohesin 2, 
complete and partial 
cds, alternative 
splicing 


0.55 


1729760 


(Z68152) chitinase [Gossypium 
hirsutum] 


8-6 




AJ011592 


Bacteriophage PI ban 
gene 


0.55 


2493689 


PHOTOS YSTEM II 10 KD 
PHOSPHOPROTEIN deltoides] 
>gi|2143326|gnl|PID|e3 19090 
(Y 13328) lOkDa 
phosphoprotem [Populus 
deltoides] 


6.6 


405 


Z15118 


T.brucei kinetoplast 
maxicircle variable 
region DNA 


0.55 


2970432 


(AF049132) NADH 
dehydrogenase subunit 5 
(Florometra serratissima] 


6.5 


406 


24895 1 


S.cerevisiae 
chromosome XVI 
cosmid 9723 


0.55 


4210432 


(AJ 130783) APC2 protein [Mus 
musculus] 


4.9 


407 


U78726 


Homo sapiens mad 
protein homolog 
Smad2 gene, 
promoter, exon la 
and exon 1 b 


0.55 


3319290 


(AF055994) thyroid hormone 
receptor-associated protein 
complex component TRAP220 
[Homo sapiens] 


4.9 


408 


AG001389 


-fomo sapiens 
genomic DNA, 21q 
region, clone; 
9HllBm42 


0.55 


125684 


KRUEPPEL PROTEIN 
>gi|72899|pir||TWFF Krueppel 
gap protein - fruit fly . 
(Drosophila sp.) melanogaster] 
>gi|224875|prfl| 1202348 A 
Krueppel sene 


3.8 


409 


M27640 


Plasmodium vivax 
major blood stage 
surface antigen gene, 
partial cds. 


0.55 


549453 


X-LlHKtb PEST- 

CONTAINING 
TRANSPORTER transporter - 
human >gi|458255 (U05321) X- 
linked PEST-containing 
transporter [Homo sapiens] 


3.8 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


^PPP^TPiW 




P vai trp 


APPF^QTPiM 




r VAJ-Ufc 






Fugu rubripes mRNA 










410 


D37977 


for sodium channel 
alpha subunit, panial 
cds 


0.55 


1435038 


(D38024) ORF [Homo sapiens] 


3.7 


411 


M 88505 


Ostertagia ostertagi 
cathepsin B-like 
cysteine protease 
gene, partial cds. 


0.55 


3941277 


(AF000900) p45 [Rattus 
norveaicus] 


2.9 


412 


U95098 


Xenopus Jaevis 
mitotic 

phosphoprotein 44 
mRNA, partial cds 


0.55 


2570154 


(AB008376) 17-kDa PKC- 
potentiated inhibitory protein of 
PP1 [Sus scrofa] 


2.8 


413 


U89241 


Human mibp gene, 
panial cds 


0.55 


4097465 


(U62253) 16kDa secretory 
protein [Sus scrofa] 


2.2 


414 


AF027151 


Xenopus laevis 
survival of motor 
neuron protein 
interacting protein 1 
(SIPl)mRNA. 
complete cds 


0.55 


4007790 


(AL034463) putaave single- 
strand polynucleotide binding ' 
protein [Schizosaccharomyces 
pombe] 


1.7 


415 


AF006821 


Bufo m annus 
natnuretic peptide 
receptor C mRNA, 
partial cds 


0.55 


2245075 


(Z97343) GTP-binding RAB2A 
protein 


1.7 


416 


YI2736 


Lactococcus lactis 
cremoris piasmid 
pIW565 DNA. 
UabiiiM, UabiiR genes 
and orfX 


0.55 


3386334 


(AF035 120) type I procollagen 
pro-alpha 2 chain [Canis 
familiaris] 


1.3 


417 


U38307 


Mus muse ul us 
collagen alpha- 1 type 
1 gene. 5* flanking 
region, partial 
sequence. 


0.55 


1362802 


gastric mucin - human 
'fragment) >ai|547517 


1.3 


A 1 0 


D 13473 


1 M f-V VTA £ — 

Mouse mRNA for 
Rad51 protein 


0.55 


1374698 


D8j032) nuclear protein, 
NP220 [Homo sapiens] 


1.3 


419 


AF045238 


Bungarus fasciatus 
acetylcholinesterase 
gene, alternatively 
spliced products, 
panial cds 


0.55 


3261734 


(Z94752) hypothetical protein 
Rv 1004c 


0.99 


| 420 


AE000795 


Methanobacten urn 
thermoautotrophicum 
from bases 1 to 
1020S (section I of 
148) of the complete 
genome | 


0.55 


186396 


( M94 131) mucin [Homo 
sapiens] 


0.97 




3 
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iSSS 


Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BiasiX vs, Non-Redundant Proteins) 


SEO 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Y lipolytic a SEC62 






(281068) F25H5.2 




421 


X99537 


gene 


0.55 


3876397 


[Caenorhabditis elegans] 


0.58 




U08147 


Aquilegia sp. 
phytochrome 
(PHYB/D) gene, 
partial cds. 


0.55 


2338024 


(Ar00537u) nbonucleotidc- 
reductase, large subunit 


0.57 


423 


Z56586 


H.sapiens CpG DNA, 
clone 12c8, reverse 
read cpgl2c8.rtld . 


0.55 


3320122 


(U46007) cspin [Ratrus 
norvesicus] 


0.44 


424 


U39442 


Mus musculus 
glmamine:fructose-6- 
phosphate 
amidotransferase 
(GFAT) gene, 5* 
region and partial cds 


0.55 


282600 


hypothetical protein - 
Mycoplasma hyorhinis 


0.43 


425 


K02298 


Rat chymoirypsin B 
(chyB) gene, 
complete cds. 


0.55 


3413810 


(Y 17034) Bassoon [Mus 
musculus] 


0.33 


426 


X84792 


Mmusculus clusterin 
gene 


0.55 


1652475 


(D90905) hypothetical protein 


0.25 


427 


U00185 


Capra aegagrus 
Saanen and Weisse 
Edel breeds DR beta- 
chain antigen binding 
domain, MHC class II 
DRB 


0.55 


2507136 


SUBTILIN BIOSYNTHESIS 
PROTEIN SPAB 


0.19 


428 


Zd4946 


H. sapiens CpG DNA, 
clone 1 78a 12, reverse 
readcpgl78al2.rtla . 


0.55 


80/646 


(M 17294) unknown protein 
[Human herpes virus 4) 


fi HAS 


429 


AF031650 


Oryctolagus 
cuniculus anion 
exchanger 3 brain 
isoform (AJE3), 
mRNA. complete cds 


0.55 


1778210 


(U68412) fibrillar collagen 
f Arenicola marina] 


0.044 


430 


M25579 


Bovine adenylyl 
cyclase Type I 
mRNA. complete cds. 


0.55 


2649040 


( AE000997) conserved 
hypothetical protein 
(Archaeoglobus fulgidus] 


0.023 


431 


Z48796 


H.sapiens Ski-W 
mRNA for helicase 


0.55 


330452 1 


(M 14708) DNA polymerase 
[Human cytomegalovirus] 


0.023 
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SEQ 
CD 



432 



Nearest Neighbor (BlastN vs» Genbank) 



ACCESSION 



M80234 



433 



434 



435 



436 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 



ACCESSION 



DESCRIPTION 



P VALUE 



Cow dopamine 
transporter mRNA, 
putative cds. 



U9I616 



D10910 



L22013 



Z92653 



Human I kappa B 
epsilon (IkBe) 
mRNA, complete cds 



Arabidopsis thaliana 
Atpk7 gene for 
serine/threonine 
protein kinase, 
complete cds 



Swinepox virus 
complete ORFS 
C20L-C1L>:: 
gbfL58297|I58297 
Sequence 14 from 
patent US 5651972 



Human 

immunodeficiency 
virus type 1 env gene 



0.55 



0.55 



0.55 



0.54 



0.54 



3874972 



3875577 



3876072 



<NONE> 



<NONE> 



(Z99709) similar to Elongation 
factor Tu family (contains 
ATP/GTP binding P-loop); 
cDNA EST EMBL:D76223 
comes from this gene; cDNA 
EST yk478c5.5 comes from this 
gene [Caenorhabditis elegans] 



(Z68314) similar to G-protein; 
cDNA EST EMBL:C 11959 
comes from this gene; cDNA 
EST EMBL:C 10341 comes 
from this sene; cDNA EST 
yk494e4.3 comes from this 
gene; cDNA EST yk448a8,5 
comes from this gene comes 
from this gene; cDNA EST 
EMBL:C 10341 comes from this 
gene; cDNA EST yk494e4.3 
comes from this gene; cDNA 
EST yk44SaS 5 comes from this 
gene [Caenorhabditis elegans] 
>ui|3880364lgnI|PID|el349948 
(Z83016) similar to G-protein; 
cDNA EST EMBL:C 1 1 959 
comes from this gene; cDNA 
EST EMBL:C 10341 comes 
from this gene; cDNA EST 
yk494e4.3 comes from this 
gene; cDNA EST yk448a8.5 
comes from this gene 
Caenorhabditis eleeansj 



(Z31505) Similarity to 
vletanococcus hypothetical 
protein 0682 (TR:Q58095) 
Caenorhabditis eleeans] 



<NONE> 



<NONE> 



4e-04 



7e-06 



4e-42 



<NONE> 



<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 

ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















437 




t-cou pnospnate- 
repressible 
peri plasm ic 
phosphate-binding 
protein (phoS), 
peripheral membrane 
proteins (pstC, pstB 
and phoU) and 

inipffril m^mhnn^ 

protein (pstA) genes, 
complete cds. 


U.J4 


<rt\Jr*t> 


<NUNt> 


<NONE> 


438 


AE001415 


Plasmodium 
falciparum 
chromosome 2, 
section j— oi /j oi 
the complete 
sequence 


0.54 


<NONE> 


<N0NE> 


<NONE> 




AF064030 


Helianthus ruberosus 
lectin 2 ITLKjNA, 
complete cds 


0.54 


<NONE> 


<NONE> 


<NONE> 


440 


X12591 


E.coli plasmid DNA 
forcolicin E9 


0.54 


<NONE> 


<NONE> 


<NONE> 


441 


U73679 


^acnornauuiiis 
elegans YNKl-a 
mRNA, complete cds 


0.54 


<NONE> 


<NONE> 


<NONE> 


442 


Z93990 


Unidentified 
bacterium DNA for 
16$ ribosomal RNA 


0.54 


<NONE> 


<NONE> 


<NONE> 


443 


X8D967 


B. vulgaris mRNA for 
betavulein 


0.54 


757836 


(237980) ORF12 [Escherichia 
coli] 


83 


444 


U76524 


^'vmHiifMt^ triors 

ijJlllUUvU) lllgia 

ribosome inactivating 
protein precursor 
mRNA. complete cds 


0.54 


151377 


(M80653) tetraheme 
Pseudomonas srutzeri] 


6.2 


445 


X7I800 


H. sapiens sene for 5S 
rRNA (640 bp) > .: 
emb|X7!801|HS5SR6 
40B H. sapiens gene 
for 5S rRNA (640 bp) 


0.54 


3322653 


[AE001216) T. pallidum 
predicted coding region TP0369 


2.7 


446 


U89241 


Human mi bp gene, 
partial cds 


0.54 


4097465 


(U62253) !6kDa secretory 
protein [Sus scrofal 


2.2 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















447 


LI6013 


Ranus norvegicus Q- 
like gene sequence 


0-54 


3087760 


(AJ005583) p75 protein 
[Crypthecodinium cohnii] 


0.95 




U60275 


Capra hircus skeletal 
muscle voltage -gated 
chloride channel 
gLlL-i mKNA, 
partial cds 


0.54 


1781344 


(Y 104^8) FK506 polyketide 
svnthase 


0.95 


449 


U36795 


Myxococcus xanthus 
rfbABC O-antigen 
biosynthesis operon, 
rfbA, rfbB, and rfbC 
genes, complete cds. 


0.54 


3877232 


(Z81540) predicted using 
Genefinder 


0.74 




AF053091 


Drosophila 
meianogaster eyelid 
(eld) mRNA, 
complete cds 


0.54 


2144110 


zinc finger protein R1Z - rat 
>*i|949996 


0,14 


451 


V00602 


Genome of the 
bacteriophage fd 
(Inoviridae). 


0.54 


2661620 


(AL009197) hypothetical 
protein 


0.11 


452 


UoUoUU 


Human semaphorin 
(CD 100) mRNA, 
complete cds 


0,j4 


IzjoXZ 


KLRAT1N, ULTRAHIGH-" " 
SULFUR MATRIX PROTEIN 
(UHS KERATIN) 
>gif lOvl I6|pir||A3oooo ultra- 
ligh-sulfur keratin - sheep 
>si|1306 (X55294) ultra high- 
sulphur keratin protein [Ovis 
aries I 


O.ufJJ 


453 


X85969 


S.coelicolor secD, 
secF & apt genes 


0.54 


3874972 


Z99709) similar to Elongation 
factor Tu family (contains 
ATP/GTP binding P-Ioop); 
cDNA EST EMBL.D76223 
comes From this gene; cDNA 
EST yk478c5.5 comes from this 
gene fCaenorhabditiselegans] 


7e-06 


454 


Y08265 


H.sapiens mRNA for 
DAN26 protein, 
partial 


0.54 


3875131 


(Z70750) similar to vanadate 
resistance protein 
transmembranous domains 
[Caenorhabditis elegans] 


5e-12 



%1 
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Nearest Neighbor (BiastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Hydromantes 










*f jj 


U89613 


platycephalus 
cytochrome b (cytb) 
gene, mitochondrial 
gene encoding 
mitochondria] 
protein, partial cds 


0.53 


<NONE> 


<NONE> 


<NONE> 


456 


AF034597 


Habrobracon hebetor 
Lyiucru urnc uaiuo^c 
II gene, partial cds; 
and tRNA- Asp, tRNA 
His, and iRNA-Lys 
genes, complete 
sequence, 

mitochondrial genes 
for mitochondrial 
products 


0.53 


<NONE> 


<NONE> 


<NONE> 


457 


K026d3 


Yeast (S.cerevisiae) 
tau repetitive element 
and Cys-tRNA. 


0.53 


<NONE> 


<NONE> 


<NONE> 


458 


X53416 


Human mRNA for 
actin-binding protein 
(filamin) 


0.53 


2134839 


bullous pemphigoid antigen 2 - 
luman 


6.2 


459 


M55545 


Drosophila 
subobscura alchohol 
dehydrogenase (Adh) 
gene, and alchohol 
dehydrogenase (Adh- 
dup) gene, complete 
cds's. 


0.53 


' 2136865 


lair keratin cysteine rich protein 
- sheep 


2.1 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















460 


U 1 9362 


thermoauto trop h i c u m 
methylene- 
tetrahydromethanopte 
rin dehydrogenase 
(mtd), 

imidazoleglycerol- 
phosphate 

* * 

dehydrogenase 
(hisB), and putative 
ferredoxin (fdxA) 
genes, complete cds, 
orf9 gene, partial cds, 
orfs ... 


0.53 


731969 


HYPOTHETICAL 91.6 KD 
PROTEIN IN HXT8-CRT1 
INTERGENIC REGION 
>gi|1078261|pir||S50773 
probable membrane protein 
YJL212c- yeast 
(Saccharomyces cerevisiae) 
>gi|496950 (Z34098) ORF 
[Saccharomyces cerevisiae] 
>gi|1015596 (Z49487) ORF 
YJL212c 


0.54 


461 


AB01 1527 


Rattus norvegicus 
mRNAfor MEGF1, 
complete cds 


0.53 


417037 


GERM CELL-LESS PROTEIN 
fruit flv (Drosoohila 
melanogaster) >gi| 157490 
(M97933) germ cell-less protein 
[Drosophila melanogaster] 


3e-06 


462 


U64313 


Bacillus firmus MsyB 
pene 5* UDStream 
region and partial cds 


0.52 


<NONE> 


<NONE> 


<NONE> 


463 


AF008590 


Caenorhabditis 
elegans paraquat 
responsive protein 
(CePqM132) mRNA, 
complete cds 


0.52 


<NONE> 


<NONE> 


<NONE> 


464 


L 10245 


Mus saxicola 
spermidi ne/spermine 
N l-acetyltransferase 
(SSAT) gene, 
complete cds. 


0.52 


<NONE> 


<NONE> 


<NONE> 


465 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
A) mRNA, complete 
cds 


0.52 


124263 


INJ>UL1N-L1K£ GROWTH 
FACTOR IB PRECURSOR 
(IGF-IB) (SOMATOMEDIN C) 
>gi|69361|pir||lGHU!B insulin- 
like growth factor IB precursor - 
luman prepropeptide [Homo 
sapiens] 


7.7 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Caenornaoaitis 










■+0O 


ALO21066 


elegans cosmid 
H31B20, complete 
sequence 
[Caenorhabditis 
eleeans] 


0.52 


2589162 


{Doty* j I ) aioenyac oxidase [z*ea 
mays) 


6.0 


467 


AF038588 


Porphyra linearis 18S 
ribosomal RNA gene, 
3" partial sequence 


0.52 


1055055 


(Uj98J0) coded tor by (J. 
elegans cDNA yk37gl.5; coded 
for by C. elegans cDNA 
yKjC".j f coaeo for oy 
elegans cDNA ykla9.5; 
alternatively spliced form of 
F52C9.8b 


4.6 


468 


AE001I25 


Borrelia buradorferi 
(section 1 1 of 70) of 
the complete senome 


0.52 


4115827 


(AB021287) polyprotein 
[Hepatitis G virus] 


2.0 


469 


AF006640 


Drosophila 
melanogaster Ste20- 
like protein kinase 
raRNA. complete cds 


0.52 


1 109830 


(U41534) coded for by C. 
elegans cuis A Cfcco i4zr. 
Similar to helicases of 
SNF2/RAD54 family. 
[Caenorhabditis elegans] 


0.002 


470 


U90177 


Aplysia califomica 
ubiquitin carboxyl- 
terminal hydrolase 
(Ap-uch) mRNA, 
complete cds 


0.51 


<NONE> 


<NONE> 


<NONE> 


471 


Z28304 


S.cerevisiae 
chromosome XI 
reading frame ORF 
YKR079c 


0.51 


<NONE> 


<NONE> 


<NONE> 


472 


Z92837 


I mm mm. mm*. M. mmmV*. A. W^k. ^4 • ♦ • t* 

LaenornaDGitis 
elegans cosmid 
R03E1. complete 
sequence 
[Caenorhabditis 
eleeans] 


0.5 i 


123506 


HYDROPHOBIC SEED 
PROTEIN (HPS) 


7.6 


473 


D13803 


Mouse mRNA for 
RecA-Hke protein 
MmRad51, complete 
cds 


0.51 


3327228 


(ABO 14607) KIAA0707 protein 
[Homo sapiens] 


4.5 


474 


X07187 


Pea hsp2 1 mRNA 


0.51 


3328678 


(AE001299) hypothetical 
protein [Chlamydia trachomatis] 


4.4 



t 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






LCAA 1 /enhancer- 










475 


S63168 


binding protein 
delta= transcription 
factor CRP3 homotog 
[human, prostate 
carcinoma cell tine 
LNCaP, Genomic. 
1594 ntl 


0.51 


1653215 


(D90911) apolipoprotein N- 
acyl transferase [Synechocystis 

sp 1 


1.2 


476 


U67078 


Xenopus laevis C2- 
HC type zinc finger 
protein X-MyTl 
mRNA, complete cds 


0.51 


3850320 


(AF067520) PITSLRE protein 
kinase beta SV2 isoform [Homo 
sapiens] 


0.17 


477 


L38933 


Homo sapiens GT198 
mRNA, complete 
ORF 


0.51 


3219965 


HYPOTHETICAL 100.6 KD 
TRP-ASP REPEATS 
CONTAINING PROTEIN 
C2C6.04C IN CHROMOSOME 
I 


0.059 


478 


AF001000 


Lycopersicon 
esculentum 
polygalacturonase 1 


0,50 


<NONE> 


<NONE> 


<NONE> 


479 


Z28304 


S.cerevisiae 
chromosome XI 
reading frame ORF 
YKR079C 


0.50 


<NONE> 


<NONE> 


<NONE> 


480 


X97225 


Oncorhynchus keta 
IGF-II eene 


0.50 


<NONE> 


<NONE> 


<NONE> 


481 


AJ001388 


Homo Sapiens, RP58 
cDNA for complete 
mRNA 


0.50 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 

rr\ 
LD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Homo Sapiens. RP58 










481 


AJ001388 


cDNA for complete 
mRNA 


0.50 


<NONE> 


<NONE> 


<NONE> 


482 


M86626 


P.occultum 23S 
ribosomal RNA, 
partial cds. 


0.50 


<NONE> 


<NONE> 


<NONE> 


483 


U76523 


Sambucus nigra leain 
precursor mRNA, 
complete cds 


0.50 


1722856 


CHROMOSOME ASSEMBLY 

PROTEIN XCAP-E African 

clawed frog >gi|563814 

(U 13674) XCAP-E [Xenopus 

laevis] 


3.2 


484 


AF031663 


Mus musculus sen a tin 
mRNA, complete cds 


0.50 


179521 


(M63730) BPAG2 [Homo 
sapiens] 


3.2 


485 


U32729 


Haemophilus 
influenzae Rd section 
44 of 163 of the 
complete sjenome 


0.50 


3875699 


(Z92829)F10A3.15 
[Caenorhabditis elegans] 


0.65 


486 


AF067198 


Dictyostelium 
discoideum clone 
9.10Tdd-3 and RED 
repetitive elements, 
partial sequence 


0.50 


2494740 


HYPOTHETICAL 28-3 KD 
PROTEIN IN GBD 5HEGION 
(ORF4) >gi|2120954|pir|II39562 
ORF4 - Alcaligenes eutrophus 
>ei|695274 (L36817) ORF4 


0.008 


487 


M23442 


Human interleukin 4 
(IL-4) gene, complete 
cds. 


0.49 


<NONE> 


<NONE> 


<NONE> 


488 


U 16367 


Caenorhabditis 
elegans POU 
homeobox protein 
CEH-18 (ceh-18) 
mRNA, complete cds. 


0.47 


3786409 


( AF098499) contains similarity 
to Saccharomyces cerevisiae 
MAFl protein (GB:U19492) 
fCaenorhabditis elegans] 


8.9 


489 


AF001000 


Lycopersicon 
esculemum 
polygalacturonase I 


0.45 


<NONE> 


<NONE> 


<NONE> 


490 


Z 18920 


Yersinia 

enterocolitica wbb 
aene cluster 


0.41 


<NONE> 


<NONE> 


<NONE> 


491 


D86983 


Human mRNA for 
KIAA0230 gene, 
partial cds 


0.35 


206712 


(M64793) salivary proline-rich 
protein [Rattus norvegicus] 


4e-05 


492 


AF064030 


Heliamhus tuberosus 
lectin 2 mRNA, 
complete cds 


0.33 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 






r VALUE 












VitreosciIIa sp. outer 










493 


AF0670S3 


membrane protein 
homoiog gene, 
complete cds; Trp 
repressor binding 
protein gene, partial 

■ it 

cds; and unknown 
genes 


0.33 


401553 


HYPOTHETICAL 24.5 KD 
PROTEIN IN NADB-SRMB 
INTERGENIC REGION 


8.3 


494 


Y15520 


Papio hamadryas 
anubis gene encoding 
fertilin alpha-II 


0.29 


. 2408049 . 


(299164) hypothetical protein 


3.1 


495 


U33475 


Alestes sp. 
ependymin mRNA, 
pania] cds 


0.28 


3913078 


ARVL HYDROCARBON 
RECEPTOR NUCLEAR 
TRANSLOCATOR 
HOMOLOG (DARNT) 
(TANGO PROTEIN) 
transcription factor [Drosophiia 
melanogaster] 


1.4 


496 


D88356 


Mouse DNA for 8- 
oxodGTPase, 
complete cds 


0.22 


<NONE> 


<NONE> 


<NONE> 


497 


U67603 


Methanococcus 
jannaschii section 145 
of 150 of the 
complete genome 


0.22 


2209261 


(U51222) p40 [Streptomyces 
ha 1st edit] 


8.3 


498 


U82386 


Malurus cyaneus 
microsatellite McyU2 


0.22 


992631 


(U29131) Mg-chelatase subunit 
fSynechocystis sp.] 


0.56 


499 


Z49625 


Scerevistae 
chromosome X 
reading frame ORF 
YJR125C 


0.21 


<NONE> 


<NONE> 


<NONE> 


500 


U64830 


Dicryostelium 
discoideum AX2 
protein tyrosine 
kinase gene, complete 
cds. 


0.21 


<NONE> 


<NONE> 


<NONE> 


501 


M24543 


Human prostate- 
specific antigen (PA) 
gene, complete cds. 


0.21 1 


2764859 


(X9791S) gene 12.1 
(Bacteriophage SPPl] 


6.0 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Pr 


oteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












u0002b protein - 




502 


X87618 


B.taurus mRNA for 
thrombospondin 
(partial) 2162 bp 


0,21 


2146000 


Mycobacterium tuberculosis 
tuberculosis] 

>gi| 1 694863|gnl[PID|e283 373 
(Z83018) hypothetical protein 
Rv2968c [Mycobacterium 
tuberculosis] 


3.5 


503 


X71591 


B.taurus 
microsatellite 
sequence INRA048 


0.21 


1354453 


(U52830) orf [Homo sapiens] 


2.7 


504 


X57808 


Human germline 
immunoglobulin 
lambda light chain 
gene 


0.21 


2119158 


procollagen type V alpha 2 - 
mouse >gi|309181 


2,7 


505 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 44 
mRNA. partial cds 


0.21 


2497139 


H V PUT HE 1 KJ AL / O KJD " 
PROTEIN IN ABF2-CHU2 
INTERGEN1C REGION 
>g i 1 1 07 8003 |pi r||S 5 2 83 5 
hypothetical protein YMR075w - 
yeast (Saccharomyces 
cerevisiae) >gi(763022 
(24S952) unknown 
{Saccharomyces cerevisiael 


2.0 


506 


U84216 


Mycobacterium 
fortuitum plasmid 
pJAZ38 replication 
protein Rep (rep) 
gene, complete cds 


0.21 


2499087 


UUP- 

GLUCOSE:GLYCOPROTEIN 
GL UCOS YLTRANSFER ASE 
PRECURSOR (DUGT) 
glucosyltransferase - fruit fly 
(Drosophila sp.) 
glucosyltransferase precursor 
[Drosophila melanogaster] 


0.003 


507 


U3 1463 


Rattus norvegicus 
nonmuscle myosin 
heavy chain-A 
mRNA, complete cds. 


0.2 J 


3880111 


(28 1 1 30) predicted using 
Genefinder 




508 


X51508 


Rabbit mRNA for 
aminopeptidase N 
(partial) 


0.21 


630864 


LRR47 protein - fruit tly 
(Drosophila melanogaster) 
>gi[4 15947 (X75760) LRR47 
[Drosophila melanosaster] 


le-06 


509 


AF086476 


Homo sapiens full 
leneth insert cDNA 
clone ZD88F12 


0.20 


<NONE> 


<NONE> 


<NONE> 


510 


AF077006 


Helicobacter pylori 
plasmid pHPM186. 
complete sequence 


0.20 


<N0NE> 


<NONE> 


<NONE> 


511 


X75480 


E.gunnii CAD ?ene. 


0.20 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BtastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
DD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






T.aestivum 












X75036 


mitocnonanaj naa/ 
gene for NADH 
dehydrogenase 
subunit 7 


0.20 


<NONE> 


<NONE> 


<NONE> 


513 


D90875 


E.coH genomic DNA, 
Kohara clone 
#422(55.5-55.8 min.) 


0.20 


<NONE> 


<NONE> 


1 

<NONE> 


514 


Z68343 


CJaenorhabditis 
elegans cosmid 
F59B8* complete 
sequence 
[Caenorhabditis 
elegans] 


0.20 


<NONE> 


<NONE> 


<NONE> 


515 


X62486 


M.musculus V alpha 
11.1 gene 5'-region 


0.20 


<NONE> 


<NONE> 


<NONE> 


516 


AF040651 


Caenorhabditis 
elegans cosmid 
W04H10 


0.20 


1170683 


yHOSFHUKVLAMiB 
KINASE ALPHA 
KbUULAlOKY LriAIN, 
SKELETAL MUSCLE 
ISOFORM 

(PHOSPHOR YLASE KINASE 
ALPHA M SUBUNIT) 
>gi|2I35923|pir||I38Ui 
phosphorylase kinase (EC 
2.7.1.38) - human >gi|791043 


7.4 


517 


U 10470 


Pseudomonas 
fluorescens PHA 
depolymerase (phaZ) 
gene, complete cds. 


0.20 


3721862 


(ABO 16024) Pfj2 [Plasmodium 
falciparum] 


L9 


518 


D83778 


Human mRNA for 
KIAA0194 gene > 
partial cds 


0.20 


' 126363 


LAMININ ALPHA- 1 CHAIN 
PRECURSOR precursor - 
human 


0.65 


519 


S43579 


c-scr=pp60c-src» 
sdr=src downstream 
region 


0.20 


4159887 


(AC0049O8) similar to 
ribosomal protein L23a; similar 
to P29316(PID:gl32848) 
[Homo sapiens] 


0.52 


520 


U07357 


Mus musculus BaJh/c 
brain-specific kinase 
(Bsk) mRNA. 
complete cds. 


0.20 


206712 


(M64793) salivary proline-rich 
protein [Rattus norvegicusl 


0.51 
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Nearest Neighbor (BlastN vs. Gen bank) 


Nearest Neighbor f BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















521 


AF034460 


fenicilljum tfiomn 
internal transcribed 
spacer h 5.8S 
nbosomal RNA gene 
and interna] 
transcribed spacer 2, 
complete sequence; 
and 28S nbosomal 
RNA gene, partial 
sequence 


0.20 


114136 


AMINO-ACID 
ACETYLTRANSFERASE 
Pseudomonas aeruginosa 
>gi|151036 (M38358)N- 
acetylglutamate synthase 
[Pseudomonas aeruginosa] 


0.39 


522 




Xenopus laevis 
mitotic 

phosphoprotein 44 
rnKTNA, paniai cos 






r*UU UUMAUN LlAbS A 

ASSOCIATING FACTOR 1 (B- 

CELL-SPECIFIC 

CO ACTIVATOR OBF-1) (OCT 

BINDING FACTOR 1) (BOB- 

l)(OCA-B)Bobl,B-ceII- 

specific - mouse 

>gi| 1 88 1 S 1 S fbbs| 1 7985 2 

nriRnh I — R»ceII ^nerific 
transcriptional coactivator line 
J558L. Peptide, 256 aa] 
>gi| 1353792 (U43788) Oct 
oincuns i actor i [ivius muscuiusj 




523 


X95971 


S.lividans groEL2 
gene 


0.20 


3925277 


(AL0J-64i) similar to 
Uneharacterized protein family 
UPF0034. Double-stranded 
RNA binding motif; cDNA EST 

v l' -1 Q 0 K ^ n <~ r\ m c frri m thic 
YK"TO/OJ.J CUUICa tlUIU Uili 

gene; cDNA EST yk439g7.5 
comes from this gene 
[Caenorhabditis elegans) 


4e-19 


524 


L41502 


Ovis anes 
vasopressin VI 
receptor (VI R) gene, 
complete cds 


0.19 


<NONE> 


<NONE> 


<NONE> 


525 


J03885 


K.pneumoniae 
oxalacetate 
decarboxylase alpha 
subunit gene, 
complete cds. 


0.19 


<NONE> 


<NONE> 


<NONE> 


526 


AE00145I 


Helicobacter pylori, 
strain J99 section 12 
of 132 of the 
complete genome 


0.19 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 








■ 








527 


D88084 


Pedicularis 
verticil lata 
chloroplast DNA, 
intergenic region 
between tniT(UGU) 
and tmL(UAA)5exon 


0.19 


<NONE> 


<NONE> 


<NONE> 


528 


U67599 


Methanococcus 
jannaschii section 14 
of 150 of the 
complete genome 


0.19 


<NONE> 


<NONE> 


<NONE> 


529 


JO550O 


Human beta-spectrin 
(SPTB) mRNA, 
complete cds. 


0.19 


<NONE> 


<NONE> 


<NONE> 


530 


Y10137 


M.mycoides ftsY 
gene homologue and 
gene encoding 
hypothetical protein 


0.19 


<NONE> 


<NONE> 


<NONE> 


531 


AF027174 


Arabidopsis thatiana 
cellulose synthase 
catalytic subunit (Ath* 
B) mRNA, complete 
cds 


0.19 


<NONE> 


<NONE> 


<NONE> 


532 


D43805 


Mouse thymic 
stromal cell mRNA 
for TLSF-beta, 
complete cds 


0.19 


<NONE> 


<NONE> 


<NONE> 


533 


AJ012585 


Tetrahymena 
thermophila 
macronuclear gene 
encoding ribosomal 
protein L3.exons 1-2 


0.19 


<NONE> 


<NONE> 


<NONE> 


534 


X51475 


Srassica napus 5- 
enolpynivylshikimate- 
3-phosphate synthase 
cene 


0,19 


<NONE> 


<NONE> 


<NONE> 


535 


AF074386 


Sambucus nigra 
levein-Iike protein 
mRNA. complete cds 


0.19 


<NONE> 


<NONE> 


<NONE> 


536 


249625 


S.cerevisiae 
chromosome X 
reading frame ORF 
YJR125c 


0.19 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neiehbor (BlastN vs. Gcnbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Ptoteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






H.sapiens pilot 










537 


X63741 


mRNA 


0.19 


<NONE> 


<NONE> 


<NONE> 


538 


YI1255 


O latiDes mRNA for 
annexin max4 


0.19 


<NONE> 


<NONE> 


<NONE> 


539 


T 63537 


Oncorhynchus my kiss 
(clone Jb-10) beta-2 
microglobulin (B2m) 
mRNA comDlete cds. 


0.19 


<NONE> 


<NONE> 


<NONE> 


540 


X70903 


N.tobacum T92 gene 
for auxin-binding 
protein 


0.19 


<NONE> 


<NONE> 


<NONE> 


541 


U61958 


Caenorhabditis 
C25A8 


0.19 


<NONE> 


<NONE> 


<NONE> 


542 


U JJ7J7 


Macaca fascicularis 
fertilin beta mRNA* 


0 19 


<NONE> 


<NONE> 


<NONE> 


543 




H.sapiens mRNA for 
protein disulfide 
ibumci die 


0 19 


n 1 13940 


(Z95556) hypothetical protein 
Rv2507 


9.4 


544 


AF035458 


Spinacia oleracea 
protein, complete cds 


0.19 


267293 


PROBABLE E4 PROTEIN 
papillomavirus (type I) 
>gi|6l015 (X62844) E4 gene 
product [Pygmy chimpanzee 
papillomavirus type 1] 


9.4 


545 


U23441 


Tetrahymena 
thermophila B 

int#»t"nnl H^* 1 P f 1 on 
HIICIIIUI UC'tUUII 

sequence. 


0.19 


3877185 


(Z66563) F46C3.2 
[Caenorhabditis elegans] 


93 


546 


U53921 


Pneumocystis carinii 
major surface 
glycoprotein 


0.19 


3548901 


(AF052502) DA26 homolog 
(Epiphyas postvittana 
nucleopolyhedro virus] 


9.3 


547 


L11002 


Rat ankyrin binding 
glycoprotein- 1 related 
mRNA sequence. 


0.19 


3337352 


(AC004481) putative chromatin 
structural protein Supt5hp 


9.1 


548 


U67560 


Methanococcus 
jannaschii section 102 
of 150 of the 
complete genome 


0.19 


3183689 


(Y 1 3585) serotonin receptor 4 
[Cavia porcellus] 


8.7 


549 


U 18424 


Mus musculus 
bacteria binding 
macrophage receptor 
MARCO mRNA. 
complete cds. 


0.19 


3659853 


(AF0S9083) complement 
component ClqB like protein 


7.1 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BbstX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(U5S751)C07Gl.7gene 




550 


X66467 


Calbicans sec 18 gene 


0.19 


1326385 


product [Caenorhabditis 
elegansl 


6.9 


551 


AF003487 


Syngaster iepidus 16S 
ribosomal RNA gene, 
panial sequence 


0.19 


3122039 


DIHY DROP YRIMIDINASE 
(DHPASE) dihydropyrimidinase 
- rat 

>gi|I378019|gnlfPID|dl010479 


6.9 


552 


J05087 


Rat calmodulin- 
sensitive plasma 
membrane Ca2+- 
transporting ATPase 
(PMCA3) mRNA, 
complete cds. 


0.19 


422462 


hypothetical protein - fruit fly 
(Drosophila melanogaster) 
>gi|296434 (X68408) ORF 
[Drosophila melanogaster] 


5.3 


553 


AF080464 


Homo sapiens 
glutamate 
oxaloacetate 
transaminase 


0.19 


3024834 


PROBABLE E4 PROTEIN 
>gi|790898 position 3286..328S 
is first start codon; putative 


5.3 


554 


U78876 


Human MEK kinase 
3 mRNA, complete 
cds 


0.19 


1710445 


(U78083) unknown [Emericella 
nidulans) 


1 

5.3 


555 


AB 00907 7 


Vigna radiata mRNA 
for proton 
pyrophosphatase, 
complete cds 


0.19 


3256922 


(AP000002) 256aa long 
hypothetical protein 
Pvrococcus horikoshii] 


5.1 


556 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 44 
mRNA. partial cds 


0.19 


4226159 


'API 25463) contains similarity 
to BTB (also known as BR- 
C/Ttfc) domains (Pfam:PF0065 1 . 
Score=62.8. E=7.6e-15, N=i) 
Caenorhabditis elegans] 


4.1 


557 


AE000392 


Escherichia coli K-12 
MG1655 section 282 
of 400 of the 
complete aenome 


0.19 


3645960 


(AL031583) 1- 

evidence=predicted by content; 
l-method=genefinder;084; I- 
method_score=47.46; 1- 
evidence_end; 2- 
evidence=predicted by match; 2- 

match_;iecession=SWISS- 
PROT:P23792;2- 
mateh_description=DISCONNE 
CTED PROTEIN.; 2-matc... 


4.0 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Pn 


steins) 


SEQ 

ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















558 


- 

AE000392 


Escherichia coli K- 12 
MG1655 section 282 
of 400 of the 
complete genome 


0.19 


3645960 


(ALOjI^KJ) 1- 

evidence=predicted by content; 
1 -method=genefinder;084; 1 - 
method_score=47.46; 1- 
evidence_end; 2- 
e vide nce=predic ted by match; 2- 
match_accession=SWISS- 
PROT:P23792; 2- 
match_description=DISCONNE 
CTED PROTEIN,; 2-matc... 


4.0 


559 


L81774 


Homo sapiens 
(subclone J_ ai rrom 
PI H25) DNA 
sequence 


0.19 


4001725 


(AB0l598l)MnhA 
[Staphylococcus aureus] 


3.0 


560 


AL021108 


Drosophila 
melanogaster cosmid 
clone 137E7 


0.19 


4001688 


(ABO 1 57 18) protein kinase 
fHomo sapiens) 


3.0 


561 


AB 001510 


caraous 
leptoplesioides 
mitochondrial DNA 
for NADH 
dehydrogenase 
subunit 5, partial cds 






(Z9855 1.) MAL3P6.11 

\X>\ "vcrrirvitr im fntfinftnifTI 1 
1 r JujlIlUUiuiil I ai\-iyai uiii j 


2.4 


562 


AF069696 


Egernia stokes ii clone 
EST1 microsatellite 


0.19 


3328994 


(AE001 326) Amino Acid 
(Branched) Transport 
[Chlamvdia trachomatis] 


2.4 


563 


X64144 


Rpringlei ppcAl 
gene for 

phosphoenolpyruvate 
carboxylase 


0.19 


3242974 


(AF069555) G protein-coupled 
receptor p2y3 [Meleagris 
sallopavo) 


2.3 


564 


U56897 


Human 

immunodeficiency 

vinic tvnp 1 oflo 

polyprotein (gag) 
gene, partial cds 


0.19 


* 

2257710 


(U73041) resolvase-I ike protein 
[Thiobacillus ferrooxidans] 


2.3 


565 


U57975 


Danio rerio Notch 
homoloeue 3 mRNA. 
complete cds 


| 0.19 


3874971 


(IW similar to NAD 
dependent 

epimerase/dehydratase family; 
cDNA EST EMBL:C 10103 
comes from this gene; cDNA 
EST EMBL:D66400 comes 
from this sene; cDNA EST 
EMBL:D70143 comes from this 
gew.cDNA EST yk493hll.3 
comes from ... 


1.8 



1&0 



WO0102568J^^ 



Pa ge 202 of 1046 



WO 01/02568 



PCTAJS00/18374 



ma 


Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiehb 


or (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












masquerade precursor - fruit fly 




566 


Y 12502 


R.norvegicus mRNA 
for factor Xllla 


0-19 


2133693 


(Drosophila melanogaster) 

>gi|665545 (U18130) 

masquerade [Drosophila 

melanogaster] 

>gi| 1 095942|prf||2 110286A 

masquerade gene 


1.8 


567 


S82470 


BBl-malignant cell 
expression-enhanced 
gene/tumor 
progression-enhanced 
gene [human, UM- 
UC-9 bladder 
circinomji cell line 
mRNA, 1897 nt] 


- 

0.19 


* 

2444026 


(U77783) N-methyl-D-aspartate 

rf^ppntnr Hun it nrecursor 
[Homo sapiens] 


1.8 


568 


U97408 


Caenorhabdiiis 
elegans cosmid 
F48A9 


0.19 


542433 


225K protein - Babesia bovis 
(frasment) 


1.8 


569 


U 10470 


Pseudomonas 
fluorescens PHA 
depolym erase (phaZ) 
gene, complete cds. 


0.19 


3721862 


(AB016024) Pfj2 [Plasmodium 
falciparum] 


1.7 


570 


M88160 


Ovis aries MAF2I4 
locus polymorphic 
di nucleotide repeat . 


0.19 


1293816 


(U56963)T13A10.5gene 
product [Caenorhabditis 
elegans f 


1.4 


571 


AJ131336 


nullum utmuv 

mRNA for pollen 
allergen (Hoi 1 2, 
group II) > 
emb|AJ131339|LIT13 
1339Lolium italicum 
mRNA for pollen 
allergen (Lol i 2, 
group II) > allergen 
(Poa p 2, group II) > 

emb|AJ13133S|TAEl 
31338 Triticum 
aestivum mRNA for 
pollen allergen (Tri a 
2. group 11) 


0.19 


3880447 


( AL032675) predicted using 
Gene finder 


0.82 


572 


X84036 


S cerevisiae ARG8 
and CDC 3 3 aenes 


0.19 


3882041 


(AJ01O405) hypothetical protein 


0.62 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human WD protein 






mucin - human >gi|501033 




573 


U57058 


DUO pre-mRNA, 
partial cds 


0.19 


631302 


(U14383) mucin [Homo 
sapiens) 


0.60 


574 


AF034460 


Penicillium thomii 
internal transcribed 
spacer 1, 5.8S 
ribosomal RNA gene 
and internal 
transcribed spacer 2, 

* 

complete sequence; 
and 28S ribosomal 
RNA gene, partial 
sequence 


0.19 


114136 


AMINO- AC ID 
AC ETY [.TRANSFERASE 
Pseudomonas aeruginosa 
>gi|15l036 (M38358) N- 
acetylglutamate synthase 
[Pseudomonas aeruginosa] 


0.35 


575 


U95098 


Xenopus laevis 
mitotic 

nhosohoDrotein 44 
mRNA, partial cds 


0.19 


105270 


aIpha-2-adrenergic receptor - 
human name ' AORA2R ' fHomo 

sapiens] 


0.27 


576 


AG001475 


Homo sapiens 
genomic DNA f 2 Iq 
region, clone: 
125H6N2 


0.19 


94977 


hypothetical protein 3 - 
Pseudomonas sp. (DSM 6898) 
plasmid pKB740 >gi|45867 
(X66604) ORF3 


0.16 


577 


M63284 


Mouse IgG receptor 
( be ta- Fc -gam m a -R 1 1) 
gene, exons 9 and 10, 
clones lambda* 
Fc(3.2,93). 


0.19 


3024681 


TRANSCRIPTION 
INITIATION FACTOR TFITD 
135 KD SUB UNIT (TAFIM35) 
(TAFII135) CTAFD-130) of 
RNA polymerase II transcription 
factor TFI1D [Homo sapiens] 


0.088 


578 


U38241 


Pseudomonas 
aeruginosa orotate 
phophoribosyl 
transferase (pyrE), 
catabolite repression 
control protein (crc) 
and RNasePH (rph) 
genes, complete cds 


0.19 


* 

3044086 


IAF055904) unknown 
Myxococcus xanthus] 


0.052 


579 


AF039734 


Lontra Iongicaudis 
transthyretin intron I. 
partial sequence 


0.19 


322759 


distil extensin-like protein 
clone pMG14) - common 
tobacco (fragment) >gi|19927 
(Z 1401 5) pistil extensin like 
protein [Nicotiana tabacum] 


0.030 


580 


U95094 


Xenopus laevis XL- 
INCENP (XL- 
INCENP) mRNA, 
complete cds 


0. 19 


2147194 


collagen - ParalvineNa grasslei 


0.002 
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mm 


Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor fBlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


_ 

P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















581 


AB 004232 


Drosophila 
melanogaster mRNA 
for DAD polypeptide, 
complete cds 


0.19 


2498765 


PEROXISOMAL MEMBRANE 
PROTEIN PEX16 lipolytica] 


0.002 


582 


AF098919 


Gallus gatlus alpha- 
globin gene domain 5' 
region 


0.19 


1086863 


{U41272)T03GU.6 gene 
product [Caenorhabditis 
elegansj 


4e-05 


583 


AE001457 


Helicobacter pylori, 
strain J99 section 1 8 
of 132 of the 
complete genome 


- ■ 
0,19 


2924552 


(AL022018) l- 

evidence=predicted by content; 
l-method=genefinder,084; 1- 
method_score= 165.48; 1- 
evidence_end; 2- 
evidence=predicted by match; 2- 
match_accession=AA264666; 2- 
match_descriplion=LD0835 1.5p 
rime LD Drosophila melanoga... 


3e-05 


584 


L10329 


Plasmid RP4 traE 
gene, 3" end; traD 
gene, complete cds; 
trap eene, 5' end. 


0.19 


38781 17 


(Z49068) mitochondrial carrier 
protein 


8e-07 


585 


AE00I155 


Borrelia burgdorferi 
(section 41 of 70) of 
the complete genome 


0.19 


861276 


(U28739) similar to TPR 
domains in e.g. yeast STI 1 
protein [Caenorhabditis elegans] 


2e-I2 


586 


U49979 


Orr* virus E10R 
homolog gene, partial 
cds t and DNA 
polymerase gene, 
complete cds 


0.19 


3850072 


(AL0333S5) dna-directed ma 
oolymerase iii subunit 
[Schizosaccharomyces pombe] 


ie-15 


587 


U88155 


Xenopus laevis 
RanGTPase 
activating protein 


0.19 


995714 


(X9U5S) pid:e!98503 
[Saccharomyces cerevisiae] 


4c- 16 


588 


AF061854 


Schizosaccharomyces 
pom be Clr4p (clr4) 
gene, complete cds 


0.19 


3242750 


(AC005 164) match to ESTs 
AA731 149 (NED:g2t40l38), 
AA73190S (NID:g27527l9), 
AA2S7S37 (NID:gl933519), 
AA26281I (NID:gl 898382), 
and AAS25320 (NTD:g2899l32) 


5e-19 


589 


M23865 


S.cerevisiae CHS2 
gene encoding chitin 
synthase. 


0.IS 


<NONE> 


<NONE> 


cNONE> 



1ph 
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1 Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Xenopus laevis XL- 










Jy\J 




INCENP (XL- 
INCENP) mRNA T 
complete cas 


0 18 




<NONE> 


<NONE> 


591 


AF067610 


Caenorhabditis 
elegans cosmid 
F4IA4 


0.18 


<NONE> 


<NONE> 


<NONE> 


592 


AF036329 


Homo sapiens 
gonadotropin- 

rcic«ising nurmujic 

precursor, second 
form (GnRH-II) gene, 
complete cds 


0.18 


<NONE> 


<NONE> 


<NONE> 


593 


Z49216 


rt.sapiens 
mitoxantrone- 
resistance associated 
mRNA 


0.18 


<NONE> 


<NONE> 


<NONE> 


594 


X02167 


Torulopsis glabrata 
mitochondrial DNA 
for tRNA-Thr.-His 
and -Glu upstream of 
cytochrome b aene 


0.18 


<NONE> 


<NONE> 


<NONE> 


CrtC 


Z31561 


R.communis 
(Carmencita) Scrl 
mRNA for sucrose 
carrier 


\J. i o 






<NONE> 


596 


L8 1 692 


Homo sapiens 
(subclone 2_c9 from 
PI H56) DNA 


0.IS 


1346575 


55 KD ERYTHROCYTE 
MEMBRANE PROTEIN 


8-4 


597 


X57310 


Nocardia 

lactamdurans pcbAB 
and pcbC genes for 
alpha-aminoadipyl-L- 
cysteinyl-D- valine 
synthetase and 
isopeniciltin N 
synthase 


0.18 


126404 


SEED LIPOXYGENASES (L- 
2) soybean >gi|1700I4 (J0321 1) 
lipoxygenase (EC 1.13.1 1. 12) 




598 


U18315 


Sus scrofa 

parathyroid receptor 
(PTH) mRNA. 
complete cds 


0.IS 


1022323 


(X04647) collagen alpha-2(IV) 
chain [Mus musculus] 


3.8 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor fBlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















599 


AL010158 


Plasmodium 
falciparum DNA *** 
SEQUENCING EN 
PROGRESS *** 
from contig 3-85, 
complete sequence 


0.18 


2506816 


VLHSICAN CORE PRUTfclN 
PRECURSOR 
PROTEOGLYCAN CORE 
PROTEIN 2) (GLIAL 
HYALURONATE-BINDING 
PROTEIN) (GHAP) >gi|6085 15 
(U16306) chondroitin sulfate 
proteoglycan versican V0 splice- 
variant precursor peptide 


3.7 


600 


AB005287 


Bos taurus mRNA for 
thrombospondin 1, 
complete cds 


0,18 


2146000 


u0002b protein - 
Mycobacterium tuberculosis 
tuberculosis] 

>gi|1694863|gnl|PID|e283373 
(Z830IS) hypothetical protein 
Rv2968c [Mycobacterium 
tuberculosis] 


2.9 


601 


AL021108 


Drosophila 
melanogastcr cosmid 
clone 137E7 


0.18 


3483032 


( AL03 137 1 ) hypothetical 
protein SC4G2.06 
[Strep to myces coelicolor] 


2.9 


602 


U57975 


Danio rerio Notch 
homoiogue 3 mRNA, 
complete cds 


O.iS 


85719 


collagen alpha l'(II) chain 
precursor - African clawed frog 


1.7 


603 


M30124 


P.aeruginosa 
autonomously 
replicating sequence. 


0.18 


3878017 


(AL02 1 387) similar to Zinc 
finger, C4 type (two domains); 
cDNA EST yk452f4.5 comes 
from this gene; cDNA EST 
EMBL:T00774 comes from this 
gene receptor NHR-3 
[Caenorhabditis elegans] 


1.3 


604 


X54965 


G.sp alpha 5HR DNA 


0.18 


134304 


STEM CELL PROTEIN 
chicken >gi|62845 (X63371) 
transforming capacity [Gallus 
gallusl 


1.3 


605 


U95098 


Xenopus laevis 
mitotic 

>hosphoprotein 44 
mRNA. partial cds 


0.13 


1628403 


(X9SS93) HTAFH68 (Homo 
sapiens] splicing [Homo 
sapiens] 


1.3 


606 


U20793 


Oryctolagus 
cuniculus renal 
sodium-dependent 
ahosphate transporter 
type II mRNA, 
complete cds. 


0.1S 


1705984 


92 KD TYPE IV 
COLLAGENASE 
PRECURSOR IV, 92K. 
precursor - rat >gi| 1022784 
f U36476) 92-kDa type IV 
collutzenase [Rattus norvegicus] 


1.2 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighbor f BlastX vs. Non-Redundant Proteins) 


SEQ 

CD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















607 


IP 3427 


Human 

choiecystokinin type 
A receptor (CCK- A) 

nAnp pvnnQ 1 and 


0 18 


3261734 


(Z94752) hypothetical protein 

P v IHOdr 
J\v i Uv^v 


0 97 


608 


U49953 


Rattus norvegicus 

rttrit^in Winamp VfTJlC^ 

mRNA, complete cds 


0.18 


551238 


\ Aw 1 O" ' / 1/Ct U1LC 1 Yuov I 

[Erwinia carotovora] 


0.43 


609 


J00182 


Human alpha globin 
gene cluster on 

cnruniobuiiic iu. z-cia 
gene. 


0.18 


1585259 


U <JJ gene ^/'VlliyLU'la LU^Ja 

methanolica] 


0.41 


610 


X62513 


M.gallopavo gene for 
metallothionein 


0.18 


2494740 


HYPOTHETICAL 28.3 KD 
PROTEIN TN GBD 5 REGION 

(ORF4^ ■xnfr 1 2 09 54 Ini HI 139 ^62 

0RF4 - Alcaligenes eutrophus 
>gi|695274 (L36817) ORF4 


0.31 


611 


X04862 


Goat embryonic alpha 
globin gene zeta 
exons 2-3 


0.18 


86837 


androgen receptor B - human 


0.082 


612 


M 12450 


Rat vitamin D 
binding protein 
(DBP) mRNA. 
complete cds. 


0.18 


4210432 


(AJ 130783) APC2 protein [Mus 
musculus] 


0.038 


613 


AF038539 


\/Tmc mi i cr*i i li ic mncf 1^ 

NSP-like 1 (NspII) 
mRNA, complete cds 


0.18 


3297877 


(AJ224S6S) GNAS1 [Homo 
sapiens | 


0.029 


614 


X78401 


Bacteriophage P22 
right operon, orf 48, 
replication genes 18 
and 12, nin region 
genes, ninG 
phosphatase, late 
control gene 23. orf 
60, complete cds, late 
control reaion, start 
of lysis gene 13 


0.18 


1 123087 


(U42436) C49H3.3 gene 
product [Caenorhabditis 
cleaansj 


0.009 


615 


D38754 


Pig mRNA for inter- 
alpha-trypsin 
inhibitor heavy-chain 
HI. complete cds 


0.18 


1397275 


(U61947) C06G3.S gene 
product [Caenorhabditis 
eleaans] 


7e-06 
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Nearest Neighbor (BiastN vs. Genbankj 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 

DO 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


- - 










LRR47 protein - fruit fly 




616 


X51508 


Rabbit mRNA for 
aminopeptidase N 
(partial) 


0.18 


630864 


(Drosophila melanogaster) 
>gi|4 15947 (X75760) LRR47 
[Drosophila melanogaster] 


6e-07 


617 


X54850 


S.Wuyveri linear 
plasmid pSKL DNA 
foT open reading 
frames 1-10 


0.18 


3183405 


HYPOTHETICAL FlJ KJD 

PROTEIN C2C6.07 IN 

CHROMOSOME I 

>gi|2 3705O4|gnl[PID!e339 1 94 

pombe] 

>gi|345 1 305|gnl|PID|e 1 3 1 6730 
(AL031324) very hypothetical 
protein [Schizosaccharomyces 
pombe] 


2e-08 


618 


L21954 


Human peripheral 
benzodiazepine 
receptor gene, exon 4. 


018 


392521 1 


(ALUiibibjcUNAEM 

EMBL.D70654 comes from this 

gene; cDNA EST 

EMBLZ 14359 comes from this 

gene: cDNA EST 

EMBL:D33409 comes from this 

gene; cDNA EST 

EMBL:D36239 comes from this 

gene; cDNA EST 

EMBLrZ 14766 comes from this 

2ene... 


4e~09 


619 


U09355 


Oryctolagus 
cuniculus protein 
phosphatase 2A1 B 
gamma subunit 
(skeletal muscle 
isolate) mRNA, 
complete cds. 


0.L8 


3947877 


(AL0343S2) putative mitosis 
and maintenance of ploidy 
protein [Schizosaccharomyces 
pombe | 


8e-tt 


620 


X58715 


T.cruzi hsp70 mRNA 
for 70 kDa heat shock 
protein, partial cds 


O.IS 


3024081 


MYOSIN LIGHT CHAIN 
KINASE, SMOOTH MUSCLE 
AND NON-MUSCLE 
ISOZYMES (MLCK) 
(CONTAINS: TELOKIN) 


9e-12 


621 


AF060195 


Mus musculus 
proteasome regulator 
PA28 beta subunit 
gene, complete cds 


0.JS 


861276 


(U28739) similar to TPR 
domains in e.g. yeast STI 1 
protein [Caenorhabditis eleeans] 


le-14 


622 


L27235 


Methylobacterium 
extorquens serine 
cycle proteins 


0.1S 


268S949 


(AF027208) AC 133 antigen 
[Homo sapiens] 


le-!4 
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1 Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
DD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















623 


AF006573 


Drosophila virilis 
maltase 1 (Mavl) and 
maluse 2 (Mav2) 
genes, complete cds 


0.18 


2500558 


PUTATIVE RIB O NUCLEASE 
III (RNASE tn) 
>gt|3876420|gnl|PID|e 1346063 
(281070) similar to ri bo nuclease 
[Caenorhabditis elegaxis] 


2e-23 


624 


AFOOI782 


Staphylococcus 
aureus strain SA502A 
AerB 


0.17 


<NONE> 


<NONE> 


<NONE> 


625 


AJ223364 


Homo sapiens germ- 
line DNA upstream of 
Jkappa locus 


an 


<NONE> 


<NONE> 


<NONE> 


626 


J03059 


Human 

glucoccrebrosidase 
(GCB) gene, 
complete cds 


0.17 


<NONE> 


<NONE> 


<NONE> 


627 


AB008860 


Fugu rubripes Cal2 
gene for pheromone 
receptor, complete 
cds 


0.17 


2198849 


lAr*UJ4yU0) kJKAKF IHomo 
sapiens] >gi|2665826 
(AF035771) Na+/H+ exchanger 
regulatory factor 2 [Homo 
sapiens] factor 2 [Homo 
sapiens] 

>gi|36 1 S353|gnl|P[D|d 1034 182 
exchanger isoform A3 [Homo 
sapiens) 


7.8 


623 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


0.17 


539355 


SCD25 protein (version 1) - 

veast 


7.5 


629 


AF059650 


Homo sapiens histone 
deacctylase 3 
(HDAC3) gene, 
complete cds 


0.17 


482118 


hypothetical protein C 1 5H7.1 - 
Caenorhabditis etegans 


4.5 
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SEQ 
ID 



Nearest Neighbor CBIastN vs. Genbank) 



ACCESSION 



- 



630 



631 



632 



633 



634 



635 



636 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BiastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



AF059650 



X55065 



1715280 



X04862 



AL0 10222 



X60111 



U49979 



Homo sapiens histone 
deacetylase 3 
(HDAC3) gene 
complete cds 



Chinese hamster 
metal lothionetn II 
gene 



Rattus norvegicus 
oxytocin receptor 
(OTR) gene, exon 3 
and complete cds 



P VALUE 



0.17 



0.17 



0.17 



Goat embryonic alpha 
globin gene zeta 
exons 2-3 



Plasmodium 
falciparum DNA 
SEQUENCING IN 
PROGRESS *** 
from contig 4-09, 
complete sequence 



*** 



H.sapiens mRNA for 
MRP-1 



0.17 



0.17 



0.17 



Orf virus El OR 
homolog gene, partial 
cds, and DNA 
jolymerase gene, 
complete cds 



0.17 



465932 



3657237 



HYPOTHETIC VL 012 KB — 
PROTEIN F58A4. 1 i IN 
CHROMOSOME fll 
>gi|3874287|gnI|PID|e 1344088 
EST EMBL:C 12577 comes 
from this gene; cDNA EST 
yk227e7.5 comes from this 
gene; cDNA EST yk303dl.5 
comes from this gene; cDNA 
EST yk3 14c 12.5 comes from 
this gene; cDNA ... 
EMBL.C 1 1886 comes from this 
gene; cDNA EST 
EMBL:C 12577 comes from this 
gene; cDNA EST yk227e7.5 
comes from this gene; cDNA 
EST yk303dl.5 comes from this 
gene: cDNA EST yk3 14c 12.5 
comes from this aene; cDNA ... 



(AC005 169) putative Cys3His 
zinc- finger pro tei n 



542565 



86837 



1 177322 



3237306 



cyclin E type II - fruit fly 
Drosophila melanogaster) 
>gi|429168 (X75027) 
Drosophila cyclin E type II 
[Drosophila melanogaster] 



androgen receptor B - human 



(X95466) CPG2 protein [Rattus 
norvegicus] 

>gijl588593|prf])2208498A 
plasticity-related gene [Rattus 
norveiiicusl 



3850072 



(U92715) breast cancer 
antiestrogen resistance 3 protein 



(AL033385) dna-directed rna 
polymerase tii subunit 
fSchizosaccharorovces pombe] 



4.4 



1.5 



0.45 



0.080 



7e-07 



3e-09 



7e-15 
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mm 


Nearest Neighbor (BlastN vs. Cenbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
CD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















637 


AF006573 


Drosophila virilis 
maltase 1 (Mavl) anc 
maltase 2 (Mav2) 
genes, complete cds 


0.17 


2500558 


PUTATIVE RIBONUCLEASE 
III (RNASE IE) 
>gi |3 876420|gnl|PID |e 1 346063 
(281070) similar to riboniicteasfi 
[Caenorhabditis elegans] 


2e-29 


638 


AE001141 


Borrelia burgdorferi 
(section 27 of 70) of 
the complete genome 


0.16 


1850327 


(U52370) fertilin beta [Homo 
sapiens] 


2.3 


639 


M72980 


Anthonomus zrandis 
vitellogenin gene 
(VTG), complete cds. 


- . 
012 


3242750 


( AC005 164) match to ESTs 
AA731149 (NID:g2 140138), 
AA73 1908 (NID:g2752719), 

AA2628U (NID:gl898382), 
and AA825820 (NID:g2899132) 


2e-56 


640 


AF023532 


Simulium vittatum 
ATPase 6 sene. 
mitochondrial gene 
encoding 
mitochondrial 
protein, partial cds 


0.1 1 


<NONE> 


<NONE> 


<NONE> 


641 


U76523 


Sambucus nigra lectin 
precursor mRNA, 
complete cds 


0.10 


3482965 


AL031369) putative protein 


0.49 


642 


AJ001596 


Danio rerio mRNA 
for opioid receptor 
homolosue 


0.099 


1706694 


L ANOSTEROL SYNTHASE 
5.4.99.7) - fission yeast 
Schizo^accharomvces oombe) 


2.3 


643 


U26341 


Oryctolagus 
cuniculus Na and CI 
dependent betaine 
transponer mRNA, 
complete cds. 


0.099 


2645804 


(AF033381) betaine 
homocysteine methyl transferase 
Mus musculus] 


0.59 


644 


MI 1633 


Bacteriophage Cp-5 
(S. pneumoniae) 3' 
inverted terminal 
repeat. 


0.082 


2314695 


(AE000649) type IIS restriction 
enzyme R and M protein 


4.3 


645 


X74I03 


Streptomyces sp. 
gene for alkaline 
serine protease I 


0.073 


1314734 


[U54641) 220 kDa silk protein 
Chironomus thummi] 


6.3 
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Nearest Neiahbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundam Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


* DESCRIPTION 


P VALUE 






1/^ ' , !_ II ' ' 

i_aenornaouitis 










646 


Z72509 


eleeans cosmid 
F32G8. complete 
sequence 
(Caenorhabdids 
elesans] 


0.072 


<NONE> 


<NONE> 


<NONE> 


647 


X70282 


X.laevis xanf- 1 aene 


0.070 


3851202 


(AC005954) ZO-3 [Homo 
sapiens] [Homo sapiens] 


0.40 




Z69906 


riuman uina 
sequence from 
cosmid E141E2 T on 
chromosome 22» 
complete sequence 
[Homo sapiens] 


0.069 


<NONE> 


<NONE> 


— — — 

<NONE> 




AF056940 


Drosophila virilis 
retrotransposon Tvl, 
complete sequence 


0.069 


2246532 


(U93872) ORE 73, contains 
large complex repeat CR 73 


5c- 12 


650 


AJ001I51 


Homo sapiens 
genomic sequence 


0.068 


<NONE> 


<NONE> 


<NONE> 


UJ I 


X54455 


Bacteriophage BF23 
gene 17 and 2ene 18 


0.067 


<NONE> 


<NONE> 


<NONE> 


652 


X87936 


P.pinea internal 
transcribed spacers 1 
& 2 of ribosomal 
DNA 


0.067 


2459733 


(U95374) aldehyde 
dehydrogenase [Haloferax 
volcanii] 


4.3 




ARM 9236 


Dictyostelium 
discoideum TipD 
tipD) gene, complete 
cds 


0.067 


3882275 


(AB018320) KIAA0777 protein 
Homo sapiens] 


i.I 


654 


X90592 


O.cunicutus mRNA 
or p53 Drotein 


0.067 


1703275 


METHIONINE 
AMINOPEPTIDASE 2 
(METAP 2) GLYCOPROTEIN) 
(P67) 


0.29 


655 


U4I305 


vi lis muscuius 
mutative T I /ST2 
receptor binding 
?rotein precursor 
mRNA. partial cds 


0.067 


642518 


(U 17326) neuronal nitric oxide 
synthase [Homo sapiens] 


0.29 


656 


AB00788I 


Homo sapiens 
KIAA042L mRNA, 
partial cds 


0.066 


<NONE> 


<NONE> 


<NONE> 


657 


AL0I02I3 


Plasmodium 
falciparum DNA *** 
SEQUENCING IN 
PROGRESS *** 
from contig 3-109. 
complete sequence 


0.066 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neiahbor fBlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















O 


AB0I4546 


Homo sapiens mRNA 

tor KiAAUO40 

protein, complete cds 


0.066 


1082461 


_ 

homeotic protein HB9 - human 


0.38 


659 


AF 104 156 


Rattus exulans isolate 
nuanineju 
mitochondrial D~ 
loop, partial sequence 


0.066 


1002380 


binding protein [Caenorhabdius 

elegansl 


0.29 


660 


X97581 


M.musculus mRNA 
for spall transcription 
factor 


0.066 


.. 4107313 


(AL035075) putative myosin 
heavy chain 


0.28 


661 


D85378 


numan cione ri-u in- 
acetylglucosaminyltra 
nsferase III DNA, 
exon 2 


0.066 


2114473 


(U96963) pKOmDia [Mus 
musculusl 


0.22 


662 


M97561 


Human (clone 

LA 179) chromosome 

21 sequence. 


0.065 


<NONE> 


<NONE> 


<NONE> 


663 


AE001373 


r i asmou i uiTi 
falciparum 
chromosome 2. 
section 10 of 73 of 
the complete 
sequence 




<NUNh> 


<ri{Jr*c> 




664 


S75479 


growth hormone 
receptor, growth 
hormone binding 
protein {GHR/BP 
gene} (mice, C57 

\*i\*if\'f£\ f~*f» rv»in i p 
UUCK/O. WCIHJllll*-) 

179 nt. segment 8 of 
101 


0,065 ' 


<NONE> 


<NONE> 


<NONE> 


, — ,. , 
665 


AF032922 


Homo sapiens 
syntaxin 4 binding 
protein UNC-18c 
(UNO 18c) mRNA, 
complete cds 


0.065 


3061308 


(AB006074) topoisomerasc III 
[Mus musculus] 


0.82 


666 


SS09S6 


s vp(40 J=s vp- re la ted 
nuclear 

receptor/retinoid 
signaling modulator 
[zebrafishes. mRNA. 
3S76 ml 


0.065 


13262SS 


(U58734) weak similarity to 
ankyrin G [Caenorhabditis 
elegans] | 


0.12 
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VH2H3 


Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighb 


►or (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












' HVPUlHb 1'ILAL /4.2 KD J 




667 


X59552 


G.domesticus mRNA 
for ventricular myosin 
heavy chain 


0.065 


- 

2497098 


PROTEIN IN AMJ1-KAD52 
INTERGENIC REGION 
>gi|1077l80[pir([S49745 

■ w it 

probable membrane protein 
YML034w - yeast 
(Saccharomyces cexevisiae) 
>gi|575685 (Z46659) unknown 
orfjen: 656, CAI: 0.13 
[Saccharomyces cerevisiae] 


0.014 


668 


M72980 


Anthonomus grand is 
vitellogenin oene 
(VTG), complete cds. 


0.065 


3242750 


(AC005164) match to ESTs 
AA731 149 (NID:g2140138), 
AA731908 (NID;g2752719X 
AA287837 (NID:g 19335 19), 
AA'WSll fNID*el898382> 

and AA825820 (NID:g2899132) 


5e-33 


669 


AB014546 


Homo sapiens mRNA 
for KIAA0646 
protein, complete cds 


0.064 


<NONE> 


<NONE> 


<NONE> 


670 


M30039 


Sheeppox virus strain 
KS-l ORFHiMI 
gene, partial cds; 
ORF HM2 and ORF 
HM3 genes, complete 
cds* and ORF HM4 
»ene, partial cds 


0.064 


<NONE> 


<NONE> 


<NONE> 


671 


Z680I3 


Caenorhabditis 
elegans cosmid 
W02H3, complete 
sequence 
[Caenorhabditis 
elesansl 


0.064 


<NONE> 


<NONE> 


<NONE> 


672 


ARM 1332 


Bodo saltans 
unknown mRNA, 
kinetoplast gene 
encoding kinetoplast 
protein, complete cds 


0.064 


<NONE> 


<NONE> 


<NONE> 


673 

1 


J0045 1 


Mouse germline IgG- 
3 chain gene, D-J-C 
region, and switch 
reeion. 


0.064 


<NONE> 


<NONE> 


<NONE> 
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| Nearest Neiehbor (BlastN vs. Gen bank) 


Nearest Neighbor (BiastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















674 


U41289 


Dictyostelium 
discoideum K7 
kinesin-Iike protein 
mRNA. complete cds 


0.064 


3482972 


(AL031369) putative protein 


9.3 


675 


M37395 


L.Iactis (strain SK11) 
proteinase plasm id 
pSKUl DNA, 
complete cds. 


0.064 


993019 


(X87246) alternative start codon 
[Pseudorabies virus] 


9.2 


676 


Z 15030 


H.sapiens gene for 
ventricular myosin 
light chain 2 > :: 
gbjLO 1652 (HUM VM 
LC Human 
ventricular myosin 
light chain 2 gene, 
seven exons. 


0.064 


730343 


f KULALTIN KLCUM UR 
PRECURSOR (PRL-R) mouse 
>gi|220576|gnipiD(d 100 1535 
(D 102 14) prolactin receptor 
precursor [Mus musculus] 
>gj|293770 (L148U) prolactin 
receptor [Mus musculus] 
>gt|347S42 (LI 3593) prolactin 
receptor [Mus musculus] 
receptor:ISOTYPE=Iong form 
[Mus musculus] 


9.1 


677 


Z12021 


G.max gene for 
catalase 


0.064 


2498711 


ORIGIN RECOGNITION 
COMPLEX PROTEIN. 
SUBUNIT2>gi|n85461 
(U3S472) essential ORC2- 
related fission replication factor 
Orp2 [Schizosaccharomyces 
pombe) 


5.3 


678 


L05668 


Entamoeba histolytica 
jrotein 

serine/threonine 
kinase (pstkl) gene, 
complete cds. 


0.064 


733140 


(U22453) carboxypeptidase 
Simulium vittatum] 


5.3 
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| Nearest Neighbor ( BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
tD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















679 


U50715 


Mus rnusculus alpha- 
galactosidase A gene, 
complete cds 


0.064 


- 

125398 


HYGROMYCIN-B KINASE 
(HYGROMYCIN B 
PHOSPHOTRANSFERASE) 
(APH(7")) 

>gi|66885|pir||WGSMHH 
hygromycin B 

phosphotransferase (EC 2.7.1.-) - 
Streptomyccs hygroscopicus 
>gi|58I682 (X03615) pot. hyg 
protein [Strep tomyces 
hygroscopicus] 
phosphotransferase [synthetic 
construct] >gi (2739064 cloning 
vector] >gi|2739068 
(AF025747) hygromycin B 
phosphotransferase [unidentified 
cloning vector] 


2.3 


680 


Z28182 


S.cerevisiae 
chromosome XI 
reading frame ORF 
YKLl82w 


0.064 


1079035 


Om(2D) protein - fruit fly 
(Drosophila ananassae) 
>gi|443770|gnl|PID|d 1006095 
(D26553) ORF 


1.8 


681 


M29917 


Human ornithine 
aminotransferase 
gene, exon I . 


0.064 


2317934 


(U97553) unknown [murine 
herpesvirus 68] 


1.4 


682 


AB020709 


Homo sapiens mRNA 
for KIAA0902 
protein, complete cds 


0.064 


861404 


(U29154) T07F12.3 gene 
product [Caenorhabditis 
elegans] 


0.47 


683 


AB014546 


Homo sapiens mRNA 
for KIAA0646 
protein, complete cds 


0.064 


1708118 


HOMEOBOX PROTEIN HB9 
>gi|507425 


0.35 


684 


ABO 10427 


Homo sapiens mRNA 
for NORM, complete 
cds 


0.064 


2383676 


(AF015539) precollagen P 
Mytilus edulis) 


0.018 


685 

— 


U34774 


Orf virus ankyrin-like 
repeat protein. Fl 1L 
lomolos, and FI2L 
homolog genes, 
complete cds. 


0064 


731668 


SSF1 PROTEIN 
>si|626624|pir||S46700 SSFi 
protein - yeast (Saccharomy ces 
cere vis iae) 


ie-05 


686 


AF022361 


vlus rnusculus 
neuropil in- 2(a5) 
mRNA, alternatively 
spliced, complete cds 


0.064 


4091978 : 


(AF073359) benzaldehyde 
dehydrogenase [Pseudomonas 
sp. DJ77] 


le-05 



1A6 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(AC004010) similar to Leucine- 




687 


U14331 


Sus scrofa myogenin 
gene. complete cds 


0.064 


2781386 


rich transmembrane proteins; 
44% similarity to U42767 
(PED:g 17369 18) [Homo 
sapiens] 


3e-33 


688 


AF074870 


Chironomus 
pallidivittatus clone 
1219 non-telomeric 
Ssp repeat sequence 


0.063 


<NONE> 


<NONE> 


<NONE> 


689 


225523 


H.sapiens repeat 
reeion DNA. 


0.063 


<NONE> 


<NONE> 


<NONE> 


690 


A£001378 


Plasmodium 
falciparum 
chromosome 2, 
section 15 of 73 of 
the complete 
sequence 


0.063 


<NONE> 


<NONE> 


<NONE> 


691 


Z72947 


Sxerevisiae 
chromosome VII 
reading frame ORF 
YGRl62w 


0.063 


<NONE> 


<NONE> 


<NONE> 


692 


Y 14723 


Choanomphalus 
incertus 
mitochondrial 
cytochrome c oxidase 
subunit 1 gene, partial 


0.063 


<NONE> 


<NONE> 


<NONE> 


693 


X74I03 


Streptomyces sp. 
gene for alkaline 
serine protease 1 


0.063 


1730713 


HYPUlHhtlLAL m.t> KU 
PROTEIN IN UME3-PUB 1 
INTERGENIC REGION 
>gi|2131866|p;r||S62935 
hypothetical protein YNL023c - 
yeast (Saccharomyces 
cerevisiae) 

>gi|1301855|gnI|PID|e239870 
(Z71299) ORF YNL023c 
[Saccharomyces cerevisiae] 


6.7 


694 


AF039843 


Homo sapiens 
Sprouty 2 (SPRY2) 
mRNA. complete cds 


0.063 


232217 


L>Lb I A 1 HlUNb S"- 
TRANSFERASE GST-6.0 
(GST B 1-1) 
>gi|421 198|pir|]S29772 
glutathione transferase (EC 
2.5.1.18) - Proteus mirabilis 
>gi|2126142|pir||S7l882 
glutathione transferase (EC 
2.5.1.18) B - Proteus mirabiiis 
>»il 1053076 (U3S482) 


5.2 
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1 Nearest Neighbor f B !asiN vs. Genbank ) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















695 


M63650 


Mouse M-twist gene 
mRNA, complete cds. 


0.063 


1730141 


r-KAlilLL X"MbM I'AL 
RETARDATION SYNDROME 
RELATED PROTEIN 2 
>gi|2135l29Ipir||S60173 fragile 
X mental retardation syndrome 
related protein - human 
>gi|1098637 (U31501) fragile X 
mental retardation syndrome 
related protein (Homo sapiens] 


1.8 


696 


Y I 3298 


Homo sapiens GDP 
dissociation inhibitor 
beta pseudoeene 


0.063 


1085930 


hypothetical protein 4 - fowl 
adenovirus 1 


1.3 


697 


X56600 


RatSOD-2 gene tor 
manganese- 
containing superoxide 
dismutase 


0.063 


3882143 


(AB01S254)KIAA07U protein 
[Homo sapiens} 


0.60 


698 


Z23107 


M.musculus mRNA 
for 5HTx serotonin 
receptor 


0.063 


1 70S 162 


HUNTINGTIN 
(HUNTINGTON'S DISEASE 
PROTEIN HOMOLOG) (HD 
PROTEIN) 


0.45 


699 


M20670 


Plasmodium vivax 
circumsporozoite 
protein gene. 3' end. 


0.063 


4033395 


DNA GYRASE SUBUNIT B 
subunit [Mvxococcus xanthus] 


0.35 


700 


Z62997 


H.sapiens CpG DNA, 
clone 76gl 1, reverse 
read cpg76*I l.rtla . 


0,063 


1350911 


RE 1 JNUlC ACID KLL'fclTUK 
RXR-BETA sapiens] 
>gi|3 172498 (AF065396) 
retinoic X receptor B 
dJl033Bl0.1 1 (Retinoid X 
receptor beta (RXRB)) [Homo 
sapiens) >gi[4249766 
(AF 120161) retinoic X receptor 
beta 


0.16 


701 


U95094 


Xenopus laevis XL- 
INCENP (XL- 
INCENP) mRNA, 
complete cds 


0.063 


2981200 


(AF04S732) cyclin T2b [Homo 
sapiens] 


0.090 


702 


U9509S 


Xenopus laevis 
mitotic 

?hosphoprotein 44 
mRNA, partial cds 


0.063 


3877951 


(ZS1555) predicted using 
Genefmder 


6e-07 


703 


U95094 


Xenopus be vis XL- 
NCENP (XL- 
INCENP) mRNA. 
complete cds 


0.063 


33930 IS 


(AL03U74) hypothetical 
protein 


2e-I0 
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1 tS-vr*^:-^ 
■ - . - „ 


f Nearest Neighbor f BlastN vs. Genbank) 


Nearest Neighbor (BJastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






E.coli genomic DNA. 










704 


D90872 


Kohara clone 
#419(54.7-55.1 mia) 


0.063 


2498198 


CYTOCHROME B561 
(CYTOCHROME B-561) 


3e-l9 


705 


M25528 


M.crystallinum 
ferredoxin-NADP+ 
reductase (fhr A) 
mRNA. complete cds. 


0 062 


<NONE> 


<NONE> 


<NONE> 


706 


U45256 


Strongyloides ratti 
microsatellite B DNA 


0.062 


<NONE> 


<NONE> 


<NONE> 


707 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 90 
mRNA, complete cds 


0.062 


<NONE> 


<NONE> 


i ii 

<NONE> 


708 


AF044317 


Homo sapiens 
TEL/AMLI fusion 
cene, partial sequence 


0.062 


<NONE> 


<NONE> 


<NONE> 


709 


Z73975 


Caenorhabditis 
elesans cosmid 
T06ES, complete 
sequence 
[Caenorhabditis 
eleeans] 


0.062 


3108187 


(AC004663) Notch 3 [Homo 
sapiens] 


2.9 


710 


X54232 


Human mRNA for 
leparan sulfate 
proteaslvcan 


0.062 


1076741 


chitinase (EC 3.2.1.14) 
precursor - rice precursor - rice 
>gi|S07955 (X87109) chitinase 
Oryza sativa] 


0.59 


71 1 


X03073' 


bovine retinal mRNA 
for transducin beta- 
suburut 


0.062 


477578 


sialidase - Actinomyces viscosus 
>gi|141852 


0.087 


712 


Y 12573 


^.melanogaster Jun 
and 14-3-3 zeta acne 


0.062 


3879551 


(Z70756) similar to collagen 


0.073 


713 


L26573 


3ombus teaestris 
mitochondrial 
cytochrome oxidase L 
panial cds. 


0.062 


1684959 


(U20600) NADH 
dehydrogenase subunit [Vanda 
lamellata] 


0.039 
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Nearest Neiahbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












" AiVIINUPbKI 1JJASE"B 




714 


U58994 


Human ladinin (LAD) 
gene, complete cds 


0.062 


2811078 


"{AKG1NYL 
AMINOPEPTIDASE) 
(ARGINTNE 
AMINOPEPTIDASE) 
(CYTOSOL 

AMINOPEPTIDASE IV) (AP- 
B)>gi|2039143 (U61696) 
aminopeptidase B [Rattus 
norvegicus] 


9e-06 


715 


ABO 14553 


Homo sapiens mRNA 
for KIAA0653 
protein, partial cds 


0.062 


1326350 


lU5lS74o) similar to potential 
transmembrane domains in S. 
cerevisiae milcear division 
RFT1 protein fSP:P38206) 


5e-10 


716 


L 16898 


Mus musculus 
collagen alpha I type 
XVIII mRNA, 5 end. 


0.062 


1723657 


H*miHhilCAL3lJ.:>lU> 
PROTEIN IN ERVI-GLS2 
INTERGENIC REGION 
>vji|21325S7|pir||S64322 
probable membrane protein 
YGR03 1 w - yeast 
(Saccharomyces cerevisiae) 
>gijI323010|gni(PID|e243277 
(2728l6)ORF YGR03Lw 
Saccharomyces cerevisiae] 


le-14 


717 


X99343 


M. tuberculosis 
guaA/B & choD 
genes 


0.062 


3373807 


(Z49907) B0491.1 
Caenorhabditis elegans) 


2e-19 


718 




Homo sapiens MAD- 
related gene SMAD7 
(SMAD7) mRNA, 
complete cds 


U.UOI 


<iSUlSt> 


<NUNt> 


<fNUtNfc> 


719 


L10182 


Myrmeleon sp. ISS 
ribosomal RNA. 


0.061 


<NONE> 


<NONE> 


<NONE> 


720 


Y 14723 


Choanomphalus 
incertus 
mitochondria! 
cytochrome c oxidase 
subunit 1 gene, partial 


0.061 


<NONE> 


<NONE> 


<NONE> 


721 


L27840 


Bovine respiratory 
syncytial virus 
nucleoprotein mRNA. 
complete cds. 


0.061 


542955 


nucleoporin p62 - human 


8.6 
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Nearest Neiehbor CBIastN v$. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 

— :■ 1 " sr-^ 












^us scrota 




722 


U95094 


Xenopus laevis XL- 
INCENP (XL- 
INCENP) mRNA, 
complete cds 


0.061 


494454 


>gi|4944D3|p<tt>|lt'Ufii|B bus 
scrofa Sus scrofa 
> gi| 1 42 1 2 1 0|pdb| t PCP[ Porcine 
Spasmolytic Protein (Psp) (Nmr, 

!0 n i/*t iiivc^ ^na^molvrif* 

Polypeptide 

>gi|1633061]pdb|2PSP|B Chain 
B, Porcine Pancreatic 
Spasmolytic Polypeptide 


• 

2.9 


723 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 44 
mRNA, partial cds 


0.061 


t 

3845272 


(AE001417) hypothetical 
protein [Plasmodium 
falciparum] 


1.3 


724 


U26463 


OpOnQJODOili> 

salmonicolor 
NADPH-dependent 
aldehyde reductase 
gene, complete cds 


0.061 


1710288 


(U79302; unknown [Homo 
sapiensl 


0.44 












Ei\IBL:D33048 comes from this 
gene; cDNA EST 
EMBL:D35780 comes from this 
gene; cDNA EST yk442c6.3 
comes from this gene; cDNA 
EST yk442c6.5 comes from this 
gene; cDNA EST yk398f6.3 
comes from this gene; cDNA 
E... 

>ai|39798 1 6|gnl|PID|e 13583 15 
EST EMBL:D35780 comes 
from this gene; cDNA EST 
yk442c6.3 comes from this 
gene; cDNA EST yk442c6.5 
comes from this gene; cDNA 
EST yk398f6.3 comes from this 
«ene; cDNA E... 




725 


AF035443 


Xenopus laevis weel 
homolog mRNA. 
complete cds 


0.061 


3979720 


2e-04 


726 


243584 


Caenorhabditis 
elegans cos mid 
2K 1321. com piere 
sequence 
[Caenorhabditis 
elesans] 


0.061 


3183491 


HYPOTHETICAL Sltf KD 

PROTEIN C27F2.7 IN 

CHROMOSOME III 

>gi| 10655 10 (U404 L9) C27F2.7 

gene product [Caenorhabditis 

etecansl 


3e-ll 
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1 Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 






P V AT I rp 




nr ^pp TPTtniM 


P VAJ-Ufc 












HYPOtktTlCAL 32.0 KD 




727 


X6I489 


Zea mays pep gene 
for (C3 type) 
phosphoenolpyruvate 
carboxylase 


0.061 


2496887 


PROTEIN C09F5.2 IN 
CHROMOSOME HI 
>gi|732538 (U22832) C09F5.2 
gene product (Caenorhabditis 
elegans] 


le-15 


728 


AF025408 


Drosophila 
metanogaster 
Windbeutei [wind) 
gene, complete cds 


0.061 


3702295 


(AC005783) R33083.1 [Homo 

sapiens] 


2e-60 


729 


ABO 12 106 


Brassica rapa mRNA 
forSRK45. complete 
cds 


0.060 


<NONE> 


<NONE> 


<NONE> 


730 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


0.060 


<NONE> 


<NONE> 


<NONE> 


731 


Y08682 


Ksapiens mRNA tor 
carnitine 

palmitoyltransferase I 
type I 


0.060 


3319446 


"(AF077541) contains similarity 
to class-I aminoacyURNA 
synthetases [Caenorhabditis 
eleaansl 

M 1 


8.1 


732 


U95094 


Xenopus laevis XL- 
INCENP (XL- 
INCENP) mRNA. 
complete cds 


0.060 


1041119 


(D78016) TRAE [Enterococcus 
faecal is I 


8.1 


733 


AF064030 


Helianthus tuberosus 
lectin 2 mRNA. 
complete cds 


0.060 


632209 


regulatory protein Rex - primate 
T-lymphotropic virus PTLV-L 
1 fragment) 


3.7 


734 


AF 100694 


Mus musculus 
Pomin52 mRNA. 
complete cds 


0.060 


3098348 


(AF037401) neuropeptide 
Y/peptide YY receptor Yc 
[Danio rerio] 


2.1 


735 


U95 102 


Xenopus laevis 
mitotic 

jhosphoprotein 90 
mRNA. complete cds 


0.060 


125978 


(LEUKOCYTE ANTIGEN 
RELATED) 

>gi|70t46|pir]|TpHULK 
leukocyte antigen-related 
protein precursor - human 
>iii|34267 sapiens] 


1.2 


736 


U76523 


Sambucus niura lectin 
precursor mRNA. 
complete cds 


0.060 


2055394 


(U87306) transmembrane 
receptor UNC5H2 [Rattus 
nor ve sic us J 


0.32 


737 


U69668 


Human nucleur pore 
complex-associated 
protein TPR 


0.060 


4127854 


f Y 14063; ChTl thymocyte 
antigen [Gallus gallus] 


9e-04 
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SEQ 
ID 



742 



744 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



733 I AB014553 



746 



P VALUE 



Nearest Neighbor f BlastX vs. Non-Redundant Proteins ) 



ACCESSION 



Homo sapiens. mRN A 
for KIAA0653 
protein, partial cds 



739 | U95098 



Xenopus laevis 
mitotic 

phosphoprotein 44 
mRNA, partial cds 



DESCRIPTION 



0.060 



1326350 



0.060 



740 \ Z96260 



H.sapiens telomeric 
DNA sequence, clone 
12QTEL 101, read 
12QTELOO101.seq 



741 I M93128 



Mouse homeobox 
protein (EVX2) 
mRNA, complete cds 



[Brassica rapa mRNA 
for SRK45, complete 
AB012106 cds 



Homo sapiens mRNA 
for KIAA045 I 
743 I AB007920 [protein, complete cds 



2632098 



0.059 



<NONE> 



0.059 



<NONE> 



(U5 8748) similar to potential 



P VALUE 



transmembrane domains in S. 
cerevisiae nulcear division 
RFT1 protein (SP:P38206) 



(Y15513) Prodos protein 
[Drosophila melanogaster] 



<NONE> 



0.059 



1652318 



Human histidine 
decarboxylase (HDC) 
M 60445 mRNA, complete cds 



745 U01836 



jUstilago maydis 
[exodeoxyribonucleas 
e (RECl)gene, 
[complete cds. 



0.059 



479491 



0.058 



<NONE> 



<NONE> 



(D90904) iysostaphin 
[Synechocystis sp.] 



le-09 



5e-I0 



<NONE> 



<NONE> 



transcription factor brn-3b 
human 



<NONE> 



Lycopersicon • 
esculentum cytosolic 
class II small heat 
shock protein HCT2 
(HSP17.4) mRNA. 
AF090 1 15 complete cds 



0.058 



1171908 



747 



Brassica rapa mRNA 
for SLG45, complete 
ABO 12 105 cds 



UnUUPEFITDE 
TRANSPORT SYSTEM 
PERMEASE PROTEIN OPPC 

;i|1075086|pir||D64184 
oligopeptide transpon system 
permease protein (oppC)C 
homolog - Haemophilus 
influenzae (strain Rd KW20) 
permease protein (oppC) 
[Haemophilus influenzae Rd) 



4.7 



0.71 



<NONE> 



0.053 



0.057 



3193265 



4333S5 



(AF069131) chitinase [Bacillus 
subtiiis] 



(U03973) dynein heavy chain 
isotype 7A [Tripneustes gratilla] 



1.5 



0.002 



3.4 
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1 Nearest Neiehbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 






r V ALUfc 


ACCJfcoMON 


PtCCfP f DTI A XI 

JJtoC Kir 1 IUIN 


P VALUE 






Arabidopsis thai i ana 










748 


AJ005813 


mRNAfor 
neoxanthin cleavage 
enzvme 


0.056 


<NONE> 


<NONE> 


<NONE> 


749 


Y 16828 


Lagopus iagopus 
genomic 
microsatellite 
sequence, LLST4 


0.056 


3328678 


(AE001299) hypothetical 
protein [Chlamydia trachomatis] 


4.3 


750 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRN A. complete cds 


0.055 


<NONE> 


<NONE> 


<NONE> 


751 


AF074385 


Sambucus nigra 
hevein-like protein 
mRN A, complete cds 


0.055 


137339 


69 KD PROTEIN 
>gi|94375|pir|[S 19150 
hypothetical protein, 69K - 
turnip yellow mosaic virus 


0.69 


752 


U76523 


Sambucus niera lectin 
precursor mRN A, 
complete cds 


0.035 


<NONE> 


<NONE> 


<NONE> 


753 


M92069 


Human retrovirus-like 
sequence-isoleueine c 


0.034 


<NONE> 


<NONE> 


<NONE> 


754 


S78516 


GlL=anJcyrin-like 
repeat [orf virus OV. 
NZ2, Genomic. 160S 
nt] 


0.033 


2804465 


(AF043700) contains similarity 
to human RNA-binding protein 
FUS/TLS (SW:Q2S009) 
Caenorhabditis elesansl 


0.15 


755 


M 15646 


Chicken myosin 
alkali light chain 
mRNA, complete cds, 
clone pFl. 


0.027 


3334221 


4- 

H Y DROX YPHENYLP YRUV A 
TE D [OXYGENASE 4- 
lydroxyphenylpyruvate 
dioxygenase [Mycosphaerella 
graminicola] 


6e-17 


756 


AF027174 


Arabidopsis thaliuna 
cellulose synthase 
catalvtic subunit (Am- 
B) rnRNA. complete 
cds 


0.025 


3877815 


(Z96048) predicted using 
Genefinder 


5.0 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlasiX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















757 


AJ002291 


Streptococcus 
pneumoniae pop lb 
gene, complete 


0.025 


3880437 


similar to nbose- 
phosphate pyrophosphokinase; 
cDNA EST EMBL:D73I73 
comes from this gene; cDNA 
EST £MBL:D70909 comes 
from this gene; cDNA EST 
EMBL:D73449 comes from this 
gene; cDNA EST 
EMBL;D76167 comes from this 
ge... 


1.7 


758 


X79104 


Cbotulinum (NCTC 
7272 type A) HA-33 
andP-21 eenes. 


0.024 


2648615 


(AE000970) tungsten 
formy 1 m e tha no furan 
dehydrogenase, subunit B (fwdB 
2) f Archaeoglobus fuigidus] 


6.1 


759 


U95I02 


Xenopus laevis 
mitotic 

phosphoprotein 90 
mRNA. complete cds 


0.024 


1663698 


(D83785) expressed 
ubiquitously; product similar to 
D.melano master mam Drotein. 
[Homo sapiens] 


4.7 


760 


U36197 


Chlamydomonas 
reinhardtii cobalamin- 
independent 
methionine synthase 
mRNA. complete cds 


0.024 


585723 


PEROXISOME 
PROLIFERATOR 
ACTIVATED RECEPTOR 
GAMMA (PPAR-GAMMA) 
>gi|2S38iS|pir||C42214 
Peroxisome protiferator- 
activated receptor gamma chain - 
African clawed frog >gi|2 14668 
(MS4163) peroxisome 
jrol iterator activated receptor 
gamma [Xenopus laevis) 


0.42 


761 


L38865 


Macaca mulatta 
(clone MMVA63) T- 
cell receptor alpha 
(TCR A) mRNA. 
partial cds. 


0.023 


<NONE> 


<NONE> 


<NONE> 


762 


AF035948 


Mus musculus insulin 
receptor subsfrate-3 


0.023 


2500587 


SPLICEOSOME 
ASSOCIATED PROTEIN 49 
spliceosome-associated protein 
SAP-49 - human >°i|5562i7 


0.40 


763 


X98890 


S. tuberosum mRNA 
for inorganic 
phosphate 
transporter, StPTl 


0.023 


110072 


proline- rich protein MP4 - 
mouse >ei|53 182 


0.13 
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1 Nearest Neiehbor fBlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


















YQ 1717 

A7 I HI 


L.esculentum mRNA 
ror HiJ-£Lr protein 






<INUNt> 


<NONE> 


765 


AC004498 


Homo sapiens 
chromosome 5. PI 
clone 1209C1 (LBNL 
H104) > complete 
sequence iriomo 
sapiens] 


0.022 


<NONE> 


<NONE> 


<NONE> 


/DO 


UU/UOJ 


Human prostatic acid 
phosphatase (ACPP) 
gene, exon 1 


0.022 


* <NONE> 


<NONE> 


<NONE> 


767 


X98890 


S. tuberosum mRNA 
for inorganic 
phosphate 
transporter. StPTI 


0.022 


<NONE> 


<NONE> 


<NONE> 


768 


X56488 


L.esculentum LAT59 
gene 5'flanking 
region, expressed 
during pollen 
maturation 


0.022 


<N0NE> 


<NONE> 


" 

<NONE> 


769 


M3465 1 


Pseudorabies virus 
with upstream and 
downsteam 
sequences. 


0.022 


<NONE> 


<NONE> 


<NONE> 


/ /U 


Add/27 


P.taeda gene tor 

protochlorophyllide 

reductase 


U.U2- 


3878517 


(292806) K10G4.4 
(Caenorhabditis elegans] 


4,3 


771 


U95I02 


Xenopus laevis 

mtiOUC 

phosphoprotein 90 
mRNA. complete cds 


0.022 


1 854452 


' riQO^i^ 1 ^ c • m till* tf\ r i ti t/t r\r 

^ov^ui j similar 10 sauvary 
proline rich protein P-B [Homo 
sapiens] 


4.3 


772 


U95098 


Xenopus laevis 
mitotic 

Phosphoprotein 44 
mRNA. partial cds 


0.022 


3021699 


(AB005298) BAI 2 [Homo 
sapiens] 


0,64 


773 


X71932 


rl.sapiens XB ^ene 
for tenasctn-X. intron 
14 


0.022 


627059 


iver stage antigen LSA-I - 
Plasmodium falciparum 
>ei|99I6 (X56203) liver stage 
antieen 


0.058 


774 


X 87 369 


C.perfringens nanH 
acne & ORF1. 2.3 & 4 


0.022 


2062407 


(U7S975) poty(ADP-ribose) 
ulvcohvdrolase [Bos taurus] 


0.056 



>V6 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












Ul SMALL NUCLEAR 




775 


V 1 A CO 1 


Gallus gall us mRNA 
tor lvou protein 


ft fy>"> 




RIB0NUCLEOPROTKIN 70 
KD (Ul SNRNP 70 KD) 
>gi|85864|pir||SO20l6Ul 
snRNP 70K protein - African 
clawed frog >gi|65179 
(X12430) Ul 70K [Xenopus 


0 032 


776 


ArQ03 133 


Caenorhabditis 
elegans cosmid 
T21L3 


U.Uii- 


1 7HO007 


DNA REPAIR PROTEIN 
RAD 18 >gi|l 150622 protein 
radl8 [Schizosaccharomyces 
pomoej 




777 


, AF003133 


Caenorhabditis 
elegans cosmid 
T21E3 




t /uyyy/ 


DNA REPAIR PROTEIN 
RADI8 >gi|l 150622 protein 
rod 1 8 [Schizosaccharomyces 
pom be] 


% no 


778 


U57645 


Human helix- loop- 
helix proteins Id- 1 

/TT\ t\ ~ — J 1A 1* ( T 

(ID-1) ana Id - 1 \\u- 
1) genes, complete 
cds 


0.021 


<NONE> 


<NONE> 


<NONE> 


779 


U67570 


Methanococcus 
jannaschii section 112 
of 150 of the 
complete genome 


0.021 


<NONE> 


<NONE> 


<NONE> 


780 


LOt 584 


Trypanosoma eruzi 
calcium- bind Urn 
protein (CUB2. 8) 
gene, complete cds. 


0.021 


<NONE> 


<NONE> 


<NONE> 


781 


L04787 


Borrelia hermsii outer 
membrane lipoprotein 


0.021 


<NONE> 


<NONE> 


<NONE> 


782 


U95094 


Xenopus laevis XL- 

INCENP) mRNA. 
complete cds 


0.021 


<NONE> 


<NONE> 


<NONE> 


783 


L36890 


Saccharomyces 
cerevisiae 
mitochondrion 
transfer RNA-Thrl 
(tRNA-Thr) gene; 
transfer RNA-Val 
(tRNA-Val) gene; 
oxi2 gene, complete 
cds; ORF2 and origin 
of replication (0ri5). 


0.021 


* 

<NONE> 


<NONE> 


<NONE> 
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j Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor {BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















784 


M76741 


Homo sapiens biliary 
glycoprotein (BOr) 
gene, partial cds. 


0.021 


<NONE> 


<NONE> 


<NONE> 


785 


M87504 


Tetrahymena 
LhermophiJa histone 
H3 (HHT2) gene, 
complete cds 


0.021 


<NONE> 


<NONE> 


<NONE> 


786 


U94346 


Human calpain-litce 
protease (htra-3) 
mRNA. complete cds 


0.021 


- <NONE> 


<NONE> 


<NONE> 


787 


L0I584 


Trypanosoma cruzi 
calcium-bindins 
protein (CUB2.3) 
gene, complete cds. 


0.021 


<NONE> 


<NONE> 


<NONE> 


788 


U36530 


rongo pygmaeus C i 
microsatellite. clone 
#i, from the tandem I v 
repeated genes 
encoding U2 small 
nuclear RNA (RNU2 
locus ) 


0.021 


<NONE> 


<NONE> 


<NONE> 


789 


X03833 


Human gene for 
mterleukin 1 alpha 
(IL-I alpha) 


0.021 


416974 


EARLY TRANSCRIPTION 
FACTOR 70 KD SUB UNIT 


8.9 


790 


U20806 


Dictyostelium 
discoideum guanine 
nucleotide-binding 
protein alpha subunit 
5 (G alpha 5) gene, 
complete cds. 


0.021 


1401211 


(U585i0)RNAhelicase 
homobg [Chiorarachnion 
CCMP621] 


8.8 


791 


Z59258 


H.sapiens CpG DNA. 
clone I3d2, reverse 
read epgl3d2.rtlc . 


0.021 


3121732 


ACONIT ATE HYDRATASE 
LI I KAi t HiUKU-LiAjc) 
(ACONTTASE) >gi|2183256 
(AF002133) aconitase 
[Mycobacterium avium] 


7.0 


792 


AF030692 


Plasmodium 
falciparum strain 7GS 
chloroquine 
resistance candidate 
protein (cg2) gene, 
complete cds 


0.021 


3024190 


NINE PROt&N 
>gi|2 12025 l|pir||S66581 
hypothetical protein 56 - phage 
S2 >gi|105H14(X92588) 
orf56; related to nin60 (ninE) of 
bacteriophage lambda 


5.8 


793 


U67570 i 


Methanococcus 
annaschii section 1 12 
of 1 50 of the 
complete genome 


0.021 


2341037 


(AC000t04)F19PI9.17 
Arabidopsis thaliana) 


4.0 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiehbor fBlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












nuclsar Factor nf- 




794 


D86566 


Human DNA for 
NOTCH4, partial cds 


0.021 


1708619 


KAPPA-B P100SUBUNIT 
(H2TF1) (ONCOGENE LYT- 
lU; (Li i (cur* l aiino. 
NUCLEAR FACTOR NF- 
KAPPA-B P52 SUB UNIT] 


3.1 


795 


LU648 


Streptomyces 
coelicolor sigma 
factor (rpoX) gene, 
complete cds. 


0.021 


79833 


hypothetical 1 19.5K protein 
(uvrA region) - Micrococcus 
luteus 


1.8 


796 


U95094 


Xenopus laevis XL- 
INCENP (XL- 
INCENP) mRNA. 
complete cds 


0.021 


128000 


CONVERT ASE I 
PRECURSOR (NEC 1) (PCI) 
(PROHORMONE 
CONVERTASE 1) propeptide 
processing protease [Mus 
cookiil 


1.0 


797 


U30938 


Rattus norvegicus 
microtubule- 
associated protein 2 


0.021 


468600 


(X74416) beia-3 integrin 
[Takifugu rubripes] 


1.0 


798 


D82364 


Chicken mRNA for 
TSC-22 variant, 
complete cds, clone 
SLFEST52 


0.021 


i 

693723 


27 kda amelogenin 
{alternatively soliced] 


0.61 


799 


U40041 


Gallus sallus eHAND 
mRNA. complete cds 


0.021 


3449308 


( ABO 1 1541) MEGF8 [Homo 
sapiens] 


0.21 


800 


X71932 


H.sapiens XB zene 
for tenascin-X, intron 
14 


0.021 


627059 


liver stage antiaen LSA-1 - 
Plasmodium falciparum 
>gi|99I6 (X56203) liver stage 
antisen 


0.054 


801 


AF042333 


Uryza sativa _4- 
methylene lophenol 
C24(l)methyltransfer 
ase mRNA. complete 
cds 


0-021 


854065 


(X83413)US8 [Human 
herpesvirus 6] 


0.014 


802 


L37380 


Rat apical endosomal 
glycoprotein mRNA, 
complete cds. 


0.021 


3334377 


TRANSMEMBRANE 
PROTEASE. SERINE 2 


le-05 




803 


AF003133 


Caenorhabditis 
elegans cosmid 
T21E3 


0.021 


1709997 


DNA REPAIR PROTEIN 
RADl8>gi| 1150622 protein 
radlS [Schizosaccharomyces 
pombe] 


3e-0S 
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Nearest Neighbor (BlastN vs. Genbanki 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Rabbit mRNA tor 










804 


X57689 


calcium channel BI-2 
(lambda CBP1 09 and 
CB10I) 


0,021 


2959370 


(AL022117) hypothetical 
protein 


le-10 


805 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 90 
mRNA. complete cds 


0.021 


1 109830 


(U41534) coded for by C. 
elegans ci>r*A (_fcfcM42r; 
Similar to helicases of 
SNF2/RAD54 family. 
ICaenorhabditis elegans] 


5c- 11 


806 


X77753 


Rsapiens TROP-2 
gene 


-. 

0.021 


1723657 


n 1 rvy i lie l iv^rxL* jo.j x\x/ 

PROTEIN IN ER V 1 -GLS2 . 
1NTERGENIC REGION 
>gi(2I32587|pir||S64322 
probable membrane protein 
YGR031w- yeast 
(Saccharomyces cerevistae) 
>gi( 13230 10|gnIjPID|e243277 
(272816) ORE YGR03lw 
[Saccharomyces cere vis iae] 


• 

5e-il 


807 


X98890 


S. tuberosum mRNA 
for inorganic 
phosphate 
transporter, StPTl 


0.021 


2137872 


zinc Finger protein PZF - mouse 

>gi|453376 


2c- 19 


808 


AP027173 


Arabtdopsis thaliana 
cellulose synthase 
catalytic subunit (Aih- 
A) mRNA, complete 
cds 


0020 


<NONE> 


<NONE> 


<NONE> 


809 


- 

AJ224935 


Homo sapiens 
Promotor Reaion and 
PCJC2 aene 


0.020 


<NONE> 


<NONE> 


<NONE> 


810 


U76524 


oamoucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


0.020 


<NONE> 


<NONE> 


<NONE> 


811 


X99941 


A.thaliana GBFI 
gene 


0.020 


<NONE> 


<NONE> 


<NONE> 


812 


X65138 


M.musculus mRNA 
for tyrosine kinase > 
:: gb|S57l6S|S57l6S 
Sek=Eph- related 
receptor protein 
tyrosine kinase [mice, 
mRNA, 4242 nt] 


0-020 1 


<NONE> 


<NONE> 1 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbanfc) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
tD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















813 


L04787 


Borrelia hermsii outer 
membrane lipoprotein 


0.020 


<NONE> 


<NONE> 


<NONE> 


814 


AJ223633 


Enterococcus taecium 
genes encoding 
enterocin L50A and 
enterocin L50B plus 
5' and 3' flanking 
regions 


0.020 


<NONE> 


<NONE> 


<NONE> 


815 


AB012I06 


Brass ica rapa rnRNA 
for SRK45, complete 
cds 


0.020 


<NONE> 


<NONE> 


<NONE> 


816 


AE001539 


Helicobacter pylori, 
strain J99 section 100 
of 132 of the 
complete genome 


0.020 


172292 


(LI 1895) transmembrane 
protein [Saccharomyces 
cerevisiae] 


8.4 


817 


AF074386 


Sambucus nigra 
hevein-like protein 
mRNA. complete cds 


0.020 


94173 


pol polyprotein - Chinese 
hamster incracistemal A-particle 
CHIAP34 


8.0 


818 


M55264 


Herpesvirus saimiri 
dihydrofolate 
reductase (DHFR) 
and snRNA (HSUR) 
genes, complete cds. 


0.020 


2924250 


(Z98745) dJ29Ki.2 [Homo 
sapiens] 


6.5 


819 


AF052163 


Homo sapiens clone 
24456 rnRNA 
sequence 


0.020 


1706288 


U(4J UUPAMliNfc KhLhP I UK " 
(D(2C) DOPAMINE 
RECEPTOR) 

>gi|2H9482|pir||I49246 D4 
dopamine receptor - mouse 
>gi|758427 (UI9880) D4 
dopamine receptor [Mus 
musculus] 

>gi|l095539|prf]|2 109259 A 
dopamine D4 receptor [Mus 
musculus] 


4.9 


820 


AF074387 


Sambucus nigra 
hevein-Iike protein 
mRNA, complete cds 


0.020 


2113793 


(283259) AmphiBrD8 
[Branchiostoma floridae] 


4.7 


821 


AF052163 


Homo sapiens clone 
24456 mRNA 
sequence 


0.020 


3874733 


(Z67754) cDNA EST 
EMBLT02354 comes from this 
gene; cDNA EST 
EMBL:D32698 comes from this 
sene; cDNA EST 
EMBL:D354 1 1 comes from this 

aene 


4.7 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















822 


LU002 


Rat ankyrin binding 
glycoprotein- 1 relatec 
mRNA sequence. 


0.020 


552132 


(K01664) Bkm- like protein 
[Drosophila melanogaster] 


3.8 


823 


AE00I539 


Helicobacter pylori, 
strain J99 section 100 
of 132 of the 
complete genome 


0.020 


172292 


(LI 1 895) cransmembrane 
protein [Saccharomyces 
cerevisiae] 


3,8 


824 


X98890 


S tuberosum mRNA 
for inorganic 
phosphate 
transporter, StPTI 


0.020 


• 

3879798 


l-Lorrrvj-smtrMrTtriri*" ■ — 
Domain (2 domains); cDNA 
EST yk390b(0.3 comes from 
this gene; cDNA EST 
EMBL:D7 1652 comes from this 
gene; cDNA EST yk275f8.3 
comes from this gene; cDNA 
EST yk393b9.3 comes from this 
gene; cDNA EST yk37... 
>gi|3 880220|gnl|PID)e I 349842 
yk390bI0.3 comes from this 
gene; cDNA EST 
EMBL:D7I652 comes from this 
oerte- cDNA EST vk275f8 3 
comes from this gene; cDNA 
EST yk393b9.3 comes from this 
eene;cDNA EST yk37... 


1.3 


825 


U97519 


Homo sapiens 
podocalyxin-Iike 
Drotein mRNA. 
complete cds 


0.020 


1345633 


t-l-Ih 1 RAH V DRbhULAlb 
SYNTHASE. CYTOPLASMIC 
(Cl-THF SYNTHASE) 
(METHYLENE TETRAHYDR 
OFOLATE 

DEHYDROGENASE / 
METHENYL TETRAHYDR OF 
OLATE CYCLOHYDROLASE 
^ 1 -tetrahvdrofolate synthase 
[Rattus norvesicus] 


0.066 


826 


AP003133 


Caenorhabditis 
elegans cosmid 
T2IE3 


0.020 


1709997 


DNA REPAIR PROTEIN 
RAD 18 >gi|l 150622 protein 
radlS (Schizosaccharomyces 
pom be] 


2e-07 


827 


U32857 


Saccharomyces 
cerevisiae VARi 
eene, mitochondrial 
sene encoding 
mitochondrial 
protein, 3* processing 
site, partial sequence 


0.019 1 


<N0NE> 


<NONE> 


<NONE> 



t 
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SEQ 
ID 



Nearest Neighbor (BiastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



828 AF027174 



Arabidopsis thai i ana 
cellulose synthase 
catalytic subunit (Ath 
B) mRNA* complete 
cds 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



0.019 



2506381 



829 AF034099 



Laccaria bicolor 
glyoxal malate 
synthase protein 
[mRNA. complete cds 



830 



Mus m use u I us 
Pontin52 mRNA, 
AF100694 complete cds 



831 



Human RP I and 
complement C4B 
precursor (C4B) 
U24578 genes, panial cds. 



0.019 



3880930 



0.018 



<NONE> 



832 U76523 



Sambucus nisra lectin 
precursor mRNA, 
complete cds 



833 U57649 



834 1 X 15642 



Ibioenzoruran- 
degrading bacterium 
DPO360 2.3- 
dihydroxybiphenyl 
1 ,2-dioxygenase 
(bphC) gene, 
complete cds and 2- 
hydroxy-6-oxo-6- 
phenylhexa-2.4- 
dienoic acid 
hydrolase 



0.013 



473673 



0.011 



<NONE> 



0.011 



835 X51623 



836 US3656 



Z.mays gene lor 
phosphoenolpvruvate 
carboxylase 



<NONE> 



P VALUE 



NEUROGENIC LOCUS' 

NOTCH HOMi 

PROTEIN 4 PRECURSOR 

(TRANSFORMING PROTEIN 

INT-3) mammary gene mRNA, 

complete cds. J, gene product 

[Mus muscultis] | 3.3 

~" similar to 

Phosphoglucomutase and 

phosphomannomutase 

phosphoserine; cDNA EST 

EMBL:D36 168 comes from this 

gene; cDNA EST 

EMBL:D70697 comes from this 

gene; cDNA EST yk373h9.5 

comes from this gene; cDNA 

EST EMBL:T008... \ 6e-15 



<NONE> 



<NONE> 



proline -rich protein precursor 
kidney bean vulgaris] 



3.1 



<NONE> 



<NONE> 



<NONE> 



0.0 11 



C.elegans collagen 
sene col- 13 



Ratius norvegicus NT- 
KB gene, promoter 
region 



<NONE> 



<NONE> 



0.010 



1695686 



0.008 



4240195 



(D83706) pyruvate carboxylase 
[Bacillus stearotherrnophilus] 



(AB020660) KIAA0853 protein 
[Homo sapiens] 



<NONE> 



<NONE> 



3.1 



10.0 



\3 
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Nearest Neighbor (BlastN vs. Gcnbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 

CD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












POL POLYPROTEIN 




837 


AJ222657 


Homo saDiens 2ene 
encoding retina- 
specific guanylyl 
cyclase 


0.008 


417704 


(ORE I A/ IB) TCONTAINS- 
RNA-DIRECTED RNA 
POLYMERASE ; HELICASE; 
PROTEASE 1 


7.4 


838 


ABO 12 106 


Brassica rapa mRNA 
for SRK45, complete 
cds 


0.008 


544024 


LHLUKUJb CHANNET 

PROTEIN, SKELETAL 
MUSCLE (CHLORIDE 
CHANNEL PROTEIN I) (CLC- 
1) human >gi|397I43 (Z25587) 
human ClC-l muscle chloride 

WIUHUCl ^fxUIilw jaU IC I io J 

>gi|398I6I (Z25884) human 
CIC- 1 muscle chloride channel 
[Homo sapiens] 


4.6 


839 


U /OJi't 


Sambucus nigra 
ribosome inactivating 
protein precursor 
nu\iN/\» compieie mb 


n fins 




(U 13643) similar to reverse 
transcriptase; possible 
pseudogene [Caenorhabditis 
eiegansf 


i a 


840 


rVTU 1 jLozfy 


Sambucus nigra 
ribosome inactivating 
protein precursor 
nuvr^A, complete cos 




^JUl 101/ 


(AF002539) cytochrome 
oxidase I [Austrofundulus 
limnaeus] 




841 


AF074385 


Sambucus niera 
hevein-like proiein 
mRNA. complete cds 


0.003 


1711520 


SRB-S/9 PROTEIN 
>si| 1334996 


1.6 


842 


U48734 


Human non-muscle 
alpha- actinin mRNA, 
complete cds 


0,008 


2829922 


(AC00229I) extensin 
Arabidopsis thaliana] 


0.11 


843 


U66669 


-lomo sapiens 3- 
lydroxyisobucyryl- 
coenzyme A 
hydrolase mRNA, 
complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


844 


D I 6492 


VIouse mRNA for 
5 1 00 serine protease 
of Ra-reactive factor 
(RaRF). complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neiahbor CBIastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human 










845 


D90923 


immunodeficiency 
vims type I pro viral 
DNA for envelope 
glycoprotein* partial 
cds. isolate 03 S 


0.007 


<NONE> 


<NONE> 


<NONE> 


846 


AB01 1087 


Homo sapiens mRNA 
for KIAA0515 
protein* partial cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


847 


AE000688 


Aquifex aeolicus 
section 20 of 109 of 
the complete genome 


0-007 


<NONE> 


<NONE> 


<NONE> 


848 


X63723 


B.bovis WC1.1 
mRNA 


0.007 


<NONE> 


. <NONE> 


<NONE> 


849 


AF074386 


Sambucus nigra 
hevein-like protein 
mRNA. complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


850 


J00097 


Human beta globin 
region Alu repetitive 
sequence type T. 


0.007 


<NONE> 


<NONE> 


<NONE> 


851 


D90923 


Human 

immunodeficiency 
virus type 1 proviral 
DNA for envelope 
glycoprotein, partial 
cds. isolate 03S 


0.007 


<NONE> 


<NONE> 


<NONE> 


852 


U95094 


Xenopus laevis XL- 
INCENP (XL- 
INCENP) mRNA, 
complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


853 


X9L618 


T.castaneum. 
hunchback eene 


0.007 


<NONE> 


<NONE> 


<NONE> 


854 


X03838 


Rat nontranscribed 
spacer (NTS) 
downstream of 2SS 
rRNA sene 


0.007 


<NONE> 


<NONE> 


<NONE> 


855 


M55049 


Rattus noruegicus 
interleukin-2 receptor 
alpha chain < CD25) 
mRNA. complete cds. 


0.007 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank J 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















856 


264318 


H.sapiens CpG DNA. 
clone 9e2, reverse 
readcpg9e2.rtla . 


0.007 


<NONE> 


<NONE> 


<NONE> 


857 


AFG27173 


Arabidopsts thaliana 
cellulose synthase 
catalytic subunit f Ath- 
A) mRNA, complete 
cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


858 


AF027I74 


Arabidopsts thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


859 


AFO 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


860 


X95276 


P.falciparum 
complete gene map or 
plastid-Uke DNA 


0.007 


<NONE> 


<NONE> 


<NONE> 


861 


U72396 


Lycopersicon 
escutentum class II 
small heat shock 
protein Le-HSPI7.6 
mRNA. complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


862 
— 


AF100694 


Mils muse ul us 
?ontin52 mRNA, 
complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


863 


AB000383 


-eucania seperata 
nuclear polyhedrosis 
virus DNA tor pi 3. 
xe, envelope protein, 
complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


864 


D86566 


Human DNA for 
NOTCH4. partial cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


1 865 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


0.007 


<NONE> 


' <NONE> 


<NONE> 



A36 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BtastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















866 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath 
A) mRNA, complete 
cds 


0.007 


3047072 


(AF058825) No definition line 
found [Arabidopsis thaliana] 


8.9 


867 


Af 027 174 


Arabidopsis thaliana 
eel lu Jose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


0.007 . 


975754 


(U29359) SpaO [Salmonella 
enterica) 


86 


868 


U76524 


Sambucus nigra 
ribosorne inactivating 
protein precursor 
mRNA, complete cds 


0.007 


1213557 


(U50I99) coded for by C 
elegans cDNA yk89e9.5; coded 
for by C. elegans cDNA cm7g5; 
coded for by C. elegans cDNA 
cml4b9; coded for by C. 
elegans cDNA yk52g5.5; coded 
for by C. elegans cDNA 
yk76e5.5; coded for by C 
elegans cDNA yk!3lfl 1.5; c... 


8.4 


869 


AB012106 


Brassica rapa mRNA 
for SRK45, complete 
cds 


0.007 


2499568 


rKUrKiN-u- 
ISOASPARTATE(D- 
AS PART ATE) O- 
METHYLTRANSFERASE 
(PROTEIN-BETA- 
ASP ARTATE 

METHYLTRANSFERASE) 
(PIMT) (PROTEIN L- 
ISOASPARTYL/D- 
ASPARTYL 

METHYLTRANSFERASE) 
me thy [transferase [Drosophila 
melanogaster] >gi| 1 17 1 337 
melanogaster] 

* — _____ — ~ 


8.3 


870 


AF093268 


Rattus norvegicus 
homer- lc rnRNA. 
complete cds 


0.007 


4092077 


(AF095353) toll-like receptor 4 
mutant [Mus musculusl 


6.2 


871 


AF0743S6 


Sambucus nigra 
hevein-likc protein 
rnRNA, complete cds 


0.007 


151377 


(MS0653) tetraheme 
[Pseudomonas stutzeri] 


6.2 


872 


L42319 


Bos taurus (clone 
Sal3.8) tristctraprolin 


0.007 


2507337 


TRANSCRIPTION 
TERMINATION FACTOR 
RHO 


55 
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Nearest Neighbor (BlastN vs. Ccnbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















873 


M59815 


Human comolement 
component C4A 
gene, exons 10 
through 4 1 . 


0.007 


• 

3876769 


(Zt>y63/) Similarity to Human 
Prolyl 4-hydroxylase alpha 
subunit (SW:P4HA_HUMAN); 
cDNA EST yk219gl2.5 comes 
from this gene; cDNA EST 

vkl IQdR ^ rnmfc frrwri thii 

gene; cDNA EST yk339dll,5 
comes from this gene; cDNA 
ESTyk37Ic9.3... 


5.3 


874 


X63723 


B.bovis WC1.I 
mRNA 


0 007 




(AJ001858) human SIM2 
[ Momo sapiens f 


s i 


875 


AB009864 


Expression vector 
pMElSS-FL3, 


0 007 




p45 NF-E2 related factor 2 - 
mouse museulus] 


5. 1 


876 


U76524 


Sambucus nigra 
ribosome inactivating 

f^fPitf* 1 T1 TtTf^r'X it*C Pit" 

mRNA, complete cds 


0.007 


2804497 


(AFQ43705) contains similarity 
to C2H2-type zinc fingers 


5.0 


877 


U95102 


Xenonus la^vis 
mitotic 

phosphoprotein 90 
mRNA. complete cds 


0.007 


440298 


(L27469) product of alternative 
splicing [Drosophila 
melanogaster) 


4.7 


878 


X58869 


Chicken mRNA for 

aldehvde 

dehydrogenase 


0.007 


11S5062 


(L75945) flagellar export 
protein (Borrelia burgdorferi] 


4.1 


879 


AF027735 


Vephila clavipes 
minor ampullate silk 
protein MiSpl 
mRNA, partial cds 


0.007 


2394390 


(AF017434) pmi-like gene 
product [Methylobacterium 
extorquens] 


4.0 


880 


AF 105228 


3os taurus tuftelin 
mRNA. complete cds 


0.007 


3036S02 


(A1022373) putative protein 


3.9 


1 881 


AF 100694 < 


vlus museulus 
Pomin52 mRNA, 
:omplete cds 


0.007 


i 

1 

2500$ 14 < 


H V PU IHJb 11LAL 6U I &±> 
PROTEIN T27F2.1 IN 
CHROMOSOME V 
>gi|3S803l I|gnl|PlD|el349S55 
BX42 (SW:BX42_DROME); 
:DNA EST EMBL:C07233 
:omes from this gene; cDNA 
EST EMBL:C08532 comes 
from this gene; cDNA EST 
vk501hl0.3 comes from this 
jene; cDNA EST yk50ifl.3... 


3.8 



r^r^r^s and s^^umss^ 



Page 239 of 1046 



WO 01/02568 



PCT/US00/18374 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 




DFSrR rPTION 


P V AI f VP 


r\\, V~ C o J 1 \J IN 




P VAJ-Ufc 












(U78289) rylactone synthase 




882 


X93567 


L.major mRNA for 
beta -tubulin f 1404bp> 


0.007 


2317862 


modules 4 & 5 [Streptomyces 
fradiae] 


3.0 


883 


ABO 12 106 


Brassica rapa mRNA 
for SRK45, complete 
cds 


0.007. 


3881103 


(AL032646) predicted using 
Genefinder, cDNA EST 
EMBL:D76407 comes from this 
gene; cDNA hb 1 
EMBL:C08999 comes from this 
gene; cDNA EST ykl99bl2.5 
comes from this gene; cDNA 
EST yk282a4.5 comes from this 
gene; cDNA EST EMBL:C0... 


2.7 


884 


AF041056 


Homo sapiens 
WSCR4 gene, exons 
3 and 4 


0.007 


135817 


THROMBIN RECEPTOR 
PRECURSOR human 
>gi|339677 (M62424) thrombin 
receptor [Homo sapiens] 


2.2 


oo J 


AF093268 


Rattus norvegicus 
homer- lc mRNA. 
complete cds 


0.007 


1723513 


HYPOTHETICAL 32.2 KD 
PROTEIN C22EI2.04IN 
CHROMOSOME I >gi| 1 220279 
(Z70043 1 ) unknown 


2.1 


886 


M74798 


Hevea brasiliensis 3- 
hydroxy-3- 
methvlgluiarvl- 
coenzyme A 
reductase gene, 3' 
end. 


0.007 


10012S2 


(D64003) poIvA polymerase 


1.9 


887 


Z62997 


H.sapiens LpG DNA. 
clone 76s 1 1, reverse 
read cpg76el l.rtla . 


0.007 


1176532 


H VPOTHETiCaL til. 9 K£> 
PROTEIN C34E10.8IN 
CHROMOSOME III 
>2i|5007_>l (U 10402) weakly 
similar to protein C kinase 
substrate [Caenorhabditis 


1.8 


888 


AF074386 


Sambucus nigra 
Nevein-like protein 
mRNA. complete cds 


0.007 


2498317 


DVM POLYPROTEIN 
PRECURSOR nematode 
polyprotein antigen precursor 
[Dtctyocaulus viviparus] 
>gi|158542l|prfl|2l244I4A 
polyprotein antigen/allergen 
[Dictyocaulus viviparus] 


1.2 


889 


i 

L29426 ( 


Syneehocystis species 
(strain PCC 6S03) 
drgA gene, complete 
zds. 


0.007 


388227:5 


(ABOIS320) KIAA0777 protein 
Hi>mo sapiens] 


1.1 



tat 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
DD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 










1 




890 


D83329 


Mus musculus DNA 

synthase, complete 
cds 


0.007 


1001741 


(D64004) hypothetical protein 


0.97 


891 


AB012106 


Brasstca rapa mRNA 
cds 

- 


0.007 


1723928 


HYPOTHETICAL 1 1.6 KD 
PROTEIN IN NUTI-AR02 
INTERGENIC REGION 
PRECURSOR YGL149w - 
veast fSaccharomyces 


0.94 


892 


U 


ribosome inactivating 
protein precursor 
mKiNA. complete cus 


U.UU/ 


121452 


&LUTEN1N, HI<5H 

MOLECULAR WEIGHT 
SUBUNIT 12 PRECURSOR 
>gi|82606(ptr||A24266 glutenin 
high molecular weight chain 12 
precursor - wheat >gi|21779 


0.79 


893 


AF027173 


Arabidopsis thaliana 

catalytic subunit (Ath- 
A) mRNA, complete 
cds 


0.007 


927287 


(U30294) ORF2 [PrevoteUa 
ruminicola] 


0.35 


894 


YII9I8 


H.sapiens IMAGE 
cDNA clone 268S1 


0.007 


1055 ISS 


(U40061) contains similarity to 
transmembrane domains like 
those found in sugar transporter 
proteins 


0.26 


895 


L36827 


Mus Muse ui us 
aipnaA-cryitai un- 
bindins protein 1 


0.007 


4063019 


(AF0S3061) ABC transporter 
TliF [Pseudomonas fluorescens) 


0.21 


896 


L36827 


Mus Musculus 
alnhaA-crvstallin- 
binding protein I 


0.007 


4063019 


(AF083061) ABC transporter 
TliF [Pseudomonas fluorescens) 


0.20 


897 


Z65719 


H-sapiens CpG DNA. 
clone 54c 10, reverse 
read cpg54cl0.rtla . 


0.007 


1097307 


HIC-1 eene [Homo sapiens] 


0.20 


898 


AF064029 


Heiianthus tuberosum 
lectin 1 mRNA, 
complete cds 


0.007 


1174915 


UTROPHIN (DYSTROPHIN- 
RELATED PROTEIN 1) 
(DRPI) (DRP) 

>gi|284488|pir||S28381 utrophin 
protein) [Homo sapiens] 


0.002 


899 


AF051730 


Mus musculus 
cathepsin S fCatS) 
gene, exon 6 


0.007 


1707017 


IU7S72 1) RNA helicase isolog 
[Arabidopsis thaliana] 


0.001 
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SEQ 
ID 



Nearest Neighbor f BlastN v S . Genbank) 



ACCESSION 



DESCRIPTION 



900 j U62398 



901 X7634I 



902 I M26215 



903 J AB007902 



Oryctolagus 



cumcutus 
gp42/basigin/OX- 
47/HT7 mRNA, 
complete cds. 



P VALUE 



Nearest Neighbor t BIastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



P VALUE 



M.musculus 
glutathione reductase 
mRNA. 

Rat (lambda 20B0.5) 
M-type 6- 
phosphofructo-2- 
kinase/fructose-2, 6- 
risphosphatase 



^omo sapiens 
KIAA0442 mRNA. 
partial cds 



0.007 



0.007 



0.007 



0.007 



904 U93364 



905 I AF093268 



906 I AF 1 00694 



907 AF074386 



Lactococcus lactis 
cremoris plasm id 
pNZ4000 insertion 
sequence IS9S2 
putative transposase 
gene and eps gene 
cluster 
(epsRXABCDEFGHI 
JKL), complete cds 



0.007 



Rattus norveeicus 
homer- lc mRNA, 
complete cds 



0.006 



Mus musculus 
Pontin52 mRNA, 
complete' cds 



0.006 



Sambucus niyra 
hevein-like protein 
mRNA. complete cds 



0.006 



908 AF027174 



Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath 
B) mRNA, complete 
cds 



0.006 



909 I AJ005813 



Arabidopsis ihalianj 
mRNA for 
neoxanthin cleavage 
enzvme 



0.006 



2370494 



3513303 



3036809 



2662165 



2731377 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



(Z98944) hypothetical protein 



(AC005594) R26984_l [Homo 
sapiens] 



( AL022373) putative protein 
(AB007902) H«0^12 cDNA 
clone for KIAA0442 has a 574- 
bp insertion at position 1474 of 
the sequence of KIAA0442. 
[Homo sapiens] 



2e-04 



8e-07 



6e-15 



2e-I7 



(U28739) similar to alcohol 
dehydrogenase/ribitol 
dehydrogenase [Caenorhabditis 

eiegans] 



<NONE> 



<NONE> 



<NONE> 



le-31 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 
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Nearest Neiahbor (BlastN vs. Cenbank) 


Nearest Neighbor (BiastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 








„ — - . - ., - — - , - 






910 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA. complete 
cds 


0.006 


<NONE> 


<NONE> 


<NONE> 


911 


AF093268 


Rattus norvegicus 
homer- lc mRNA. 
complete cds 


0.006 


<NONE> 


<NONE> 


<NONE> 


912 


AF093268 


Rattus norvegicus 
homer- lc mRNA, 
complete cds 


0.006 


<NONE> 


<NONE> 


<NONE> 


913 


ABO 12 1 06 


Brass ica rapa mRNA 
for SRK45, complete 
cds 


0.006 


<NONE> 


<NONE> 


<NONE> 
■ -■— 


914 


AF064029 


Helianthus tuberosus 
lectin 1 mRNA. 
complete cds 


0.006 


<NONE> 


<NONE> 


<NONE> 


915 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


0006 


<NONE> 


<NONE> 


<NONE> 


916 


AF093268 


Rattus norvegicus 
homer- lc mRNA, 
complete cds 


0.006 


4049856 


(AF063866) ORE MSV064 
hypothetical protein 
[Melanoplus sanguinipes 
entomopoxvirus] 


9.6 


917 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


0.006 


3SS0536 


1 / X '/I /111 nr&ri i/*f nCiflo 
V£.0 — \Ji\j) prCUW-VCU llAlllg 

Genefinder; similar to Lectin C- 
type domain short and long 
forms (2 domains); cDNA EST 

P\'1R1 f^inA^^ rrtm^ from thic 

gene; cDNA EST 
EMBL:C 1-2424 comes from this 
gene: cDNA EST ykl91e7.3 
comes from this 


7.9 


918 


AF012899 


Sambucus niiira 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


0.006 


3877761 


(ZS1552) F56G4.1 
(Caenorhabditis elegans] 
>gi|3S7S6l5|gni|PID|el348240 
(ZS3I18) F56G4.1 


7.5 


919 


XS0289 


H.sapiens PTPU 
mRNA for protein 
tvrosine phosphata.se 


0.006 


I16S791 


CATHEPS1N E PRECURSOR 
precursor - rabbit >gi (402729 
(L0S41S) procathepsin E 


7.4 
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Nearest Neighbor < Blast N vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


| P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






1 




DIAL V LGL V LtUOL " " 




920 




Sambucus nigra 
hevein-like protei n 
mRNA. complete cds 


0.006 


1 34637 1 


KlNA^lfc. BfcTA 

DlACYLGLYChROL 

KINASE) 

>gi)477059|pir||A47744 
diacylglycerol kinase (EC 
2.7.1.107) beta- rat 90kDa- 
diacvlolvcerol kinase (Tlattus 


5.5 


921 


U72396 


Lycopersicon 
escuienoim class II 
small heat shock 
protein Le-HSP17.6 
mRNA. complete cds 


0.006 


2196567 


(DSS58S) lipoprotein 
(Escherichia coli] 


4.3 


922 


AF074387 


Sambucus niera 
hevein-like protein 
mRNA. complete cds 


0.006 


2113798 


(ZS3259) AmphiBrf38 
I Branch iostoma floridae] 


4.3 


923 


ABO12106 


Brassica rapa mRNA 
for SRK45, complete 
cds 


0.006 


1388166 


(U582S2) Bowel [Drosophila 
melanosaster] 


4.3 


924 


AF074386 


Sambucus nigra 
hevein-like protein 
mRNA. complete cds 


0.006 


2496785 


HYPOTHETICAL 20.1 KD 
PROTEIN Y4YS 


4.2 


925 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cd> 


0.006 


416592 


a-aGGLUTININ 
ATTACHMENT SUB UNIT 
PRECURSOR 
>gi|l01l70|pir||A4l258 a- 
aiiirlutinin core protein AG A I - 
yeast (Saccharomyces 
cerevistae) 


2.7 - 


926 


AF064029 


Helianthus tuberosum 
lectin I mRNA. 
complete cds 


0.006 


416592 


A-AGGLUTIN1N 
ATTACHMENT SUBUNIT 
PRECURSOR 
>gi|101170|pir||A41258 a- 
agglutinin core protein AGA1 - 
yeast f Saccharomyces 
cerevistae) 


25 


927 


AJ0O58I3 


Arabidopsts thaliana 
mRNA for 
neoxanthin cleavage 
enzvme 


0.006 


325S584 


< LU1263) The 3* UTR ot this 
•jcne overlaps the 3' UTR of 
T19Dl2.6(confirmed by EST 
hits) [Caenorhabditis elcgans] 


2.0 


928 


U33949 


Human Down 
Syndrome region or' 
chromosome 2 1 . 
genomic sequence, 
clone AI2HMA6. 


0.006 


3S50997 


(AF067150) beta-hydroxyacyl- 
ACP dehydratase precursor 


1.9 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BiastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE* 


ACCESSION 


DESCRIPTION 


P VALUE 
















1175 


AF027173 


/\rauiQopsis inuiiuna 
cellulose synthase 
catalytic subunit (Ath- 
A) mRNA, complete 
cds 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1 176 


Y09232 


H, sapiens fertilin 
alpha pseudogene 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1177 


AJQ058b 


Arabidopsis thaliana 
mRNA for 
neoxanthin cleavage 
enzyme 


2e-04 


• <NONE> 


<NONE> 


<NONE> 


1178 


Af 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1179 


AF072847 


Homo sapiens 
putative swelling- 
activated chloride 
channel (CLNS1A) 
aene. intron 6 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1180 


AF0 12899 


Sambucus nigra 
ribosome inactivating 
Drotetn precursor 
mRNA, complete cds 


2e-04 


<N0NE> 


<NONE> 


<NONE> 


list 

1 


U76524 


ribosome inactivating 
protein precursor 
mRNA. complete cds 


2e-04 


<NONE> 


<NONE> 


<NONE> 


11821 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
A) mRNA, complete 
cds 


2e-04 


1213557 


(U50199) coded for by C. 
elegans cDNA yk89e9.5; coded 
for by C elegans cDNA cm7g5; 
coded for by C. elegans cDNA 
cml4b9; coded for by C. 
elegans cDNA yk52g5.5; coded 
for by C. elegans cDNA 
yk76e5.5; coded for by C. 
eleaans cDNA vkl31fl 15; c... 


8.4 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlasiX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












fcH I HELIAL U&LUOJIN " 




1183 


AF090U5 


Ly copers icon 
esculentum cytosolic 
class II small heat 
shock proLciii nv- i — 
(HSP17.4) mRNA, 
complete cds 


2e-04 


729008 


DOMAIN RECtFl'OR I 
PRECURSOR (TYROSINE- 
PROTETN KINASE CAK) 
(CELL ADHESION KINASE) 
(TYROSINE KINASE DDR) 
(DISCOIDIN RECEPTOR 
TYROSINE KINASE) (TRK E) 
(PROTEIN-TYROSINE 
KINASE RTK 6) sapiens] 


8J 


i 184 


AFO 12899 


Sambucus nigra 
noosome inaLiivdunij 
protein precursor 
mRNA, complete cds 


2e-04 


2507582 


HYPOTHETICAL f38. l r4l) 

PROTEIN IN MOLR-BGLX 
INTERGENIC REGION 
>°il 1788436 ( AE000300) 
putative regulator (Escherichia 
colli 


7.8 


1185 


AF074386 


Sambucus nigra 
hevein-like protein 
mRNA. complete cds 


2e-04 


1085500 


collagen alpha I(IX) chain - 
mouse musculus] 
>gi|744962|prfl|20I5346A 
coltagen:SUBUNIT=alphaI:ISO 
TYPE=CX [Mus muscuiusl 


7.8 




AF027173 


Arabidopsis ihaiiana 
cellulose svnthase 

* 

catalytic subunit (AuV 
A) mKiNA. complete 
cds 


2e-04 


2623967 


■ — ■ 

fY \ lO^T? 1 ) PiTN Reducrase 
[Aurobucterium radiobacter[ 


7.4 


1187 


AJ005813 


Arabidopsis thaliana 
mRNA for 
neoxanthin cleavage 
enzvme 


2e-(H 


2497316 


ALJVAINLclj 

GLYCOS YLATION END 

PRODUCT-SPECIFIC 

RECEPTOR PRECURSOR 

(RECEPTOR FOR 

ADVANCED 

GLYCOS YLATION END 

PRODUCTS) products receptor 

precursor - bovine >gi| 16365 I 

(M912J2) receptor for advanced 

glycosylation end products [Bos 

taurus| 


5.3 


US8 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


2e-04 


1001710 


(D64004) hvpothetical protein 


3.5 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Axabidopsis th a liana 






(U4 1263) The 3' UTR of this 




1189 


AJ005S13 


mRNA for 
neoxanthin cleavage 


?e-04 


3258584 


gene overlaps the 3' UTR of 
Tl9DI2.6(confirmed by EST 

illwj tl«UdIL)l llaUUl Hi v-JClitlll^J 


it- i 


1190 


AF027173 


Arabidopsis thaliana 
cellulose svnihase 

* 

catalytic subunit (Ath- 
A) mRNA, complete 
cds 


2e-04 


2736338 


(AF038623) contains similarity 
to RNA recognition motifs 


0.S9 


1191 


U72396 


Lyeopersicon 
esculentum class II 
small heat shock 
protein Le-HSP17.6 
mRNA. complete cds 


2e-04 


2196567 


(D88588) lipoprotein 
[Escherichia coli] 


0.69 


1192 


AF090U5 


Lyeopersicon 
esculentum cytosolic 

elfK^ tf ^mnlt h^.it 

shock protein HCT2 
(HSP17.4) mRNA. 
complete cds 


2e-04 


3319874 


(AJ006096) F-spondin 
[Branchiostoma floridae] 


5e-04 


1193 




Chiamydomonas 
reinhardtit dynein 
heavy chain alpha 
(ODAl I) gene, exons 
j.- 1 jnu puruai Luj, 


7p-Ad 


JO/U/ /J 


(Z8I077) predicted using 
Genefinder; Similarity to Yeast 
jroiein o— \ i t\,vjjo too i ) 




1194 


AF 1 00694 


Vlus musculus 
Pomin52 mRNA. 
complete cds 


le-04 


<NONE> 


<NONE> 


<NONE> 


1195 


AF064029 


rleliimthus tuberosus 
ectin 1 mRNA. 
complete cds 


le-04 


<NONE> 


<NONE> 


<NONE> 


1196 


L34219 


-lomo sapiens 
retinaldehyde-bindi ng 
protein (CRALBP) 
gene, complete cds. 


le-04 


<NONE> 


<NONE> 


<NONE> 


1197 


X5 1 S90 


Rhesus monkey 
interIeukin-3 gene 


le-04 


<NONE> 


<NONE> 


<NONE> 
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Nearest Nei ah bor I Blast N vs. Genbank) 


Nearest Neighbor {BlastX vs. Non-Redundam Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Plasmodium 










1198 


AE001421 


falciparum 
chromosome 2. 
section 58 of 73 of 
the complete 
sequence 


le-04 


<NONE> 


<NONE> 


<NONE> 


1199 


AF090115 


Lycopersicon 
esculemum cytosolic 
class II small heat 
shock protein HCT2 
(HSP17.4) mRNA, 
complete cds 


le-04 


<NONE> 


<NONE> 


<NONE> 


1200 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic su burnt (Ath- 
B) mRNA, complete 
cds 


le-04 


2576287 


(Y 15086) HepC protein 
Gylindrotheca fusiformis] 


4.7 


1201 


AJ005813 


Arabidopsis thaliana 
mRNA for 
neoxanthin cleavage 
enzvme 


le-04 


3395673 


(AB016623) RWC-3 [Oryza 
satival 


0.14 


1202 


AF03S035 


Homo sapiens 
B RCA 1 -associated 
RING domain protein 
(BARDI) gene, 
exons 2 and 3 


9e-05 


<NONE> 


<NONE> 


<NONE> 


1203 


AJ005813 


Arabidopsis thaliana 
mRNA for 
neoxanthin cleavage 
enzvme 


9e-05 


<NONE> 


<NONE> 


<NONE> 


1204 


ABO 12106 


Brassica rapa mRNA 
for SRK45, complete 
cds 


9e-05 


<NONE> 


<NONE> 


<NONE> 


1205 


U9509S 


Xenopus laevis 
mitotic 

phosphoprotein 44 
mRNA, partial cds 


9e-05 


<NONE> 


* 

<NONE> 


<NONE> 


1206 


■ 

AF034099 


Laccaria bicolor 
zlyoxal ma late 
synthase protein 
mRNA. complete cds 


9e-05 


] 
1 

1 

< 

135 1553 : 


HYPOTHETICAL 
LIPOPROTEIN MG348 
PRECURSOR 
>gi|l36166S|pir||E64238 
hypothetical protein MG348 - 
Vlycoplasma genitalium (SGC3) 
>gi|334493 1 


8.8 
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SEQ 
ID 



Nearest Neighbor (Blast N vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins; 



ACCESSION 



DESCRIPTION 



P VALUE 



12071 D50006 



12031 U50423 



12091 AJ005813 



Human DNA for 
alpha-platelet-deri vec 
growth factor 
receptor, exon 6-10 



Human Down 
Syndrome region of 
chromosome 21, 
clone A41B3-IB7. 



Arabtdopsis thaliana 
mRNA for 
nebxanthin cleavage 
enzyme 



9e-05 



3063639 



9e-05 



124273 



9e-05 



4007782 



(AF056494) NADH 
dehydrogenase subunit 5 
[Panorpa japonica] 



INH1BIN ALPHA CHAIN 
PRECURSOR bovine 
>gi|163195 (M13273) inhibin A 
subunit [Bos taurus] 



5.1 



(X72850) 2.4- 
dihydroxybenzoate 
monooxygenase [Sphingomonas 
sp] 



3.0 



2.3 



1210 ACO05276 



Homo sapiens clone 
fragment 

UWGC:gap3 from 
7q31.3, complete 
sequence [Homo 
sapiens] 



9e-05 



1492075 



1211 AF100694 



VI us musculus 
ontin52 mRNA, 
complete cds 



(U60315) MC132L [Molluscum 
contagiosum virus subtype 1) 



1.0 



9e-05 



2887423 



1212 X77772 



Cfuscus gamma-M2- 
jrystallin mRNA. 



9e-05 



2072425 



1213 AB012I06 



Brass ica rapa mRNA 
or SRK45, complete 
cds 



8e-05 



<NONE> 



(AB007884) ECIAA0424 [Homo 
sapiens] 



2e-10 



(U83 1 15) non-lens beta gamma 
crystal 1 in like protein [Homo 
sapiens] 



7e-25 



<NONE> 



<NONE> 



1214 L06178 



Apis rnellifera 
igustica complete 
mitochondrial 
senome 



3e-05 



<NONE> 



<NONE> 



<NONE> 



1215 AB012106 



Brassica rapa mRNA 
"or SRK45, complete 
ds 



8e-05 



<NONE> 



<NONE> 



<NONE> 



1216 U76524 



Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 



8e-05 



<NONE> 



<NONE> 



<NONE> 



1217 L06178 



Apis mellifera 

gustica complete 
mitochondrial 
enome 



8e-05 



<NONE> 



<NONE> 



121SI AB012106 



rassica rapa mRNA 
or SRK45, complete 
ds 



Se-05 



<NONE> 



<N"ONE> 



<NONE> 



<NON"E> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
in 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P V.ALLTE 






Mus musculus 










1219 


API 00694 


Pontin52 mRNA, 
complete cds 


8e-05 


<NONE> 


<NONE> 


<NONE> 


1220 


AB012106 


Brass ica rapa mRNA 
forSRK45. complete 
cds 


8e-05 


<NONE> 


<NONE> 


<NONE> 


1221 


AB012106 


Brassica rapa mRNA 
for SRK45, complete 
cds 


8e-05 


1722841 


WNT-l 1 PROTEIN 
PRECURSOR {XWNT-1 1) 
clawed frog >gi|439108 
(L23542) maternal protein 


9.9 


1222 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA. complete 
cds 


8e-05 


1205991 


(U35637) nebulin [Homo 
sapiens] 


9.6 


1223 


AF024605 


Homo sapiens serine 
protease- like protease 
Sequence 2 from 
patent US 5736377 


8e-05 


3242783 


(AF055354) respiratory burst 
oxidase protein B 


8.6 


1224 


Y13148 


Rattus norveaicus 
mRNA for PAG608 
sene 


8e-05 


2314243 


(AE000616) alpha-ketoglutarate 
permease (k°tP) 


8.1 


1225 


AJ005813 


Arabidopsis thaliana 
mRNA for 
neoxanthin cleavage 
enzvme 


8e-05 


1170586 


k AS (jTFASL-AL' 1 IV a 1 in 0- 
LIKE PROTEIN IQGAPl 
(PI 95) (KIAA0051) 
>gi|627594|pir||A54854 Ras 
GTPase activating-related 
protein - human sapiens] 
>gi|536844 (L33075) ras 
GTPase-activa ting- like protein 
(Homo sapiens) 


7.8 


1226 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
A) mRNA. complete 
cds 


8e-05 


464239 


NADH-UBIQUINONE 
OXIDOREDUCTASE CHAIN 
4>gi|1085t85|pir||S5:96S 
NADH dehydrogenase chain 4 - 
honeybee mitochondrion 
(SGC4) >gt|552446 (L06178) 
NADH dehydrogenase subunit 4 
[Apis mellifera ligustica) 


3.5 


1227 


AF 100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


8e-05 


544353 


F-SPOND1N PRECURSOR 


3.5 
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SEQ 
ID 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neiehbor fBlastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



P VALUE 



12281 U76524 



Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 



8e-05 



483243 



apoiipoprotein B-I00 - chicken 
(fragment) 



12291 AF093268 



Rattus norvegicus 
homer- 1c mRNA, 
complete cds 



8e-05 



91207 



proline-rich protein - mouse 
(fragment) musculus] 



3.4 



2.2 



1230 1 AF027173 



Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath 
A) mRNA, complete 
cds 



ZONADHESIN PRECURSOR 



8e-05 



249918 1 



> gi| 1066466 



12311 AF027173 



Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath 
A) mRNA, complete 
cds 



8e-05 



1232 AB012106 



iJrassica rapa mRNA 
"or SRK45, complete 
cds 



8e-05 



2499181 



ZONADHESIN PRECURSOR 
>gi[ 1066466 



2833647 



AF027972) flagellifonn silk 
protein [Nephila clavipes] 



2.2 



1.9 



1.6 



i •» ** 



AF093268 



Rattus norvegicus 
lomer- lc mRNA, 
complete cds 



8e-05 



1 163063 



(Z49821) MY02 
[Saccharomyces cerevisiae] 



0.90 



1234 AF027174 



Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath 
3) mRNA, complete 
cds 



8e-05 



1653488 



(D90914) hypothetical protein 



0.30 



1235 M26510 



Chicken nonmuscle 
myosin heavy chain 
(MHC) gene, 
complete cds. 



8e-05 



112159 



plectin - rat 



0.003 



1236 U56402 



■iuman chromatin 
structural protein 
iomo!o2 



8e-05 



2088823 



(AF003384) weak similarity to 
the peptidase family A2 



le-1 



1237 AF100694 



VIus musculus 
Pontin52 mRNA. 
complete cds 



8e-05 



437181 



123S API 00694 



Mus musculus 
Pontin5- mRNA, 
omplete cds 



(U02289) GTPase-activating 
protein [Caenorhabditis elegans] 



8e-05 



465983 



HYPOTHETICAL 80.8 KD 
PROTEIN ZC2 1.4 IN 
CHROMOSOME III 



2e-17 



8e-27 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 






P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1239 


AF090115 


Lycopersicon 
esculentum cytosolic 
class II small neat 
shock protein HCT2 
(HSP17.4) mRNA, 
complete cds 


7e-05 


<NONE> 


<NONE> 


<NONE> 


1240 


U83656 


Rattus norvegicus NF- 
KB gene, promotor 
resion 


7e-05 


3880858 


(AL031633) predicted using 
Genefinder; cDNA EST 
yk304fl2.5 comes from this 
gene (Caenorhabditis elegans] 


9.3 


1241 


AF074387 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


7e-05 


3080538 


(AL022600) hypothetical 
protei n 


9.2 


1242 


X89398 


H.sapiens ung gene 
for uracil DNA- 
olycosvlase 


. 7e-0!) 




HYPOTHL 1 1LAL 1 j. I iOJ 
PROTEIN IN MDH1-VMA5 
INTERGENIC REGION 

-xrilS^O 1 JPIni HIS 37908 
hypothetical protein YKL083w - 
yeast (Saccharomyces 
cerevisiae) >gi|486120 

/71ttnCT\ HOC YVT fW^xi; 


i s 

i -Q 


1243 


M33753 


Bovine follicle 
stimulating hormone- 
beta subunii gene, 
complete cds. 


7e-05 


2398621 


(AJ000342) DMBT1 protein, 
5.8 kb transcript [Homo sapiens] 


1.8 


1244 


MS0829 


Rat troponin T 
cardiac isoform gene, 
complete cds 


5e-05 


854065 


(XS34I3) US 8 [Human 
herpesvirus 6) 


2e-08 


124^ 


AF074387 


Sambucus nigra 
hevein-like protein 
mRNA. complete cds 


4e-05 


120240 


FLACELLIN B2 PRECURSOR 
rvietnanococcus vouae 
> 2 i|150063 (M7214S) tlasellin 


5.2 


1246 


AFO 12899 


Sambucus nigra 
riDosome inactivdiin« 
protein precursor 
mRNA. complete cds 


3e-05 


<NONE> 


<NONE> 


<NONE> 


1247 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


3e-05 


<N0NE> 


<NONE> 


<NONE> 


124S 


AF0743S6 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


3e-05 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor fBIastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Rattus norvegicus 










1249 


AF093268 


horner-lc mRNA, 
complete cds 


3e-05 


<NONE> 


<NONE> 


<NONE> 


1250 


AB012106 


Brassica rapa mRNA 
for SRK45, complete 
cds 


3e-05 


27732^6 


(AF039716) Similar to protein 

tinncp rfipnnrfinhrttf <*lpoincl 


ft 1 


1251 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


3e-05 


2072961 


(U93568) putative pl50 [Homo 
sapiens] 


5.6 


1252 


U72396 


Lycopersicon 
esculentum class II 
small heat shock 
protein Le-HSPl7.6 
mRNA. complete cds 


3e-05 


121855 


LXOGLUCANASE II 
PRECURSOR cellulose L4-beta 
cellobiosidase (EC 3.2. 1 .9 1) II 
precursor - fungus (Trichoderma 

(EC 3.2. 1.91) II - fungus 
cellobiohydroiase II 
[Trichoderma reesei] 


4.6 


1253 


U76524 


Sambucus nigra 
ribosomtf inactivating 
protein precursor 
mRNA. complete cds 


3e-05 


3880516 


(AL02I572) similar to CTP 

SYNTH ASF ( PP fi T 1 ~>\ fTPTP- 
-AMMONIA LIGASE) (CTP 
SYNTHETASE) 


3.3 


1254 


M88299 


Mouse brain- 1 POU- 
domain protein, 
complete cds. 


3e-05 


1947048 


(U66102) intimin [Escherichia 
coli] 


3.0 


1255 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 44 
mRNA, partial cds 


3e-05 


3122872 


CELL-CYCLE nuclear 

AUTOANTIGEN SG2NA 
(S/G2 NUCLEAR ANTIGEN) 
>gi|l082650|pir||JC2522 nuclear 
autoantigen - human >gi|S05095 
(U 17989) G52NA 


2.8 


1256 


U76524 


Sambucus nigra 
ribosome inactivating 
jrotein precursor 
mRNA, complete cds 


3e-05 


1352145 


cytochrome; c oxidase 

POLYPEPTIDE I chain I - 
Thermus aquaticus >gi| 155083 
(M84341) cytochrome c oxidase 
subunits precursor [Thermus 
thermophilus] 


2.6 


1-257 


U72396 


-ycopersieon 
esculentum class II 
small heat shock 
jrotein Le-HSP17.6 
mRNA. complete cds 


3e-05 


2811015 


SEGMENTATION POLARITY 
PROTEIN ENGRAILED 
>gi|2076747 (U42429) 
engrailed (Anopheles gambiae] 
>gi|214S9lS(U42214) 
engrailed [Anopheles gambiae] 


2.0 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiehbor (BlaslX vs. Non-Redundant Pre 


(teins) 


SEO 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1258 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Atn- 
B) mRNA, complete 
cds 


3e-05 


1657752 


(U62325) FE65-Iike protein 
[Homo sapiens] 


1.7 


1259 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


3e-05 


2072961 


(U93568) putative pl50 [Homo 
sapiens] 


1.5 


1260 


U76523 


Sambucus nigra lectin 
precursor mRNA. 
complete cds 


3e-05 


1352145 


rvTOCHftOtviE C OXIDa^K 

POLYPEPTIDE I chain I - 
Thermus aquaticus >gi| 155083 
(M84341) cytochrome c oxidase 
subunits precursor [Thermus 
thermophtlusl 


LI 


1261 


X9 1 890 


H.sapiens regulatory 
region of HOXA7 
gene 


3e-05 


111013 


oxr (Dkm-nomoiog; sex- 
determining region protein - 
mouse 


1.0 


1262 


L36936 


Homo sapiens metase 
gene, partial cds. 


3e-05 


1944352 


(D84239) IgG Fc binding 
protein [Homo sapiens! 


0.99 


1263 


AB012105 


Brass ica rapa mRNA 
for SLG45. complete 
cds 


3e-05 


4177S2 


SMP2 PROTEIN 
>gi|320853|pir|(S30911 SMP2 
protein - yeast (Saccharomyces 
cerevisiae) gene 
[Saccharomvces cerevisiael 


0.S9 


1264 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


3e-05 


1708501 


INTEGR1N ALPHA CHAIN- 
LIKE PROTEIN alpha Intl p 
[Candida albicans] 


0.39 


1265 


AF090115 


LycupcrsiLui 1 
esculentum cytosolic 
class II small heat 
shock protein HCT2 
(HSPl7.4)mRNA. 
complete cds 


3e-05 


15S7031 


cis-Golgi matrix protein GM130 
[Rattus norveeicusl 


0.20 


1266 


ZSI014 


Human DNA 
sequence from 
cosmid U65A4, 
between markers 
DXS366 and DXSS7 
on chromosome X * 


3e-05 


2072964 


(U93569) putative pl50 [Homo 
sapiens) 


0.049 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












glycosylated and mynstilated 




1267 


Z96668 


H.sapiens telomeric 
DNA sequence, clone 
7PTEL00i,read 
7PTELOO001.seq 


3e-05 


542429 


smaller surrace antigen - 
Plasmodium falciparum 
>gi|836640 (X76298) 
glycosylated and mynstilated 
smaller surface antigen gallusj 
>gi|l092l78|prf]|2023165B 
surface antigen 


0.029 


1268 


ABO 12 105 


Brass ica rapa mRNA 
for SLG45, complete 
cds 


3e-05 


3879121 


(Z70310) predicted using 
Genefinder; Similarity to Mouse 
ankyrin (PIR Acc. No. S37771); 
cDNA EST EMBL:T01923 
comes from this °enei cDNA 
EST EMBL:D32335 comes 
from this aene; cDNA EST 
EMBL:D32723 comes from this 
gene* LuiNrt lj... vjcncunuci, 
Similarity to Mouse ankyrin 
(PIR Acc. No. S 37771); cDNA 
EST EMBl:T0L923 comes 
from this gene; cDNA EST 
EMBL-.D32335 comes from this 
gene; cDNA EST 
EMBL:D32723 comes from this 
gene; cDNA ES ... 


2e-I3 




AF074385 


Sambucus nigra 
hevein-hke protein 
mRNA. complete cds 


3e-05 


2497677 


ZYXIN (ZYXIN 2) sapiens] 
>gi|lD43954|gnl|PlD|e22j41 / 
(X95735) zyxin 


2e-23 


1270 


AF027174 


/TidOtQupsis uiau aiu 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


le-05 


<NONE> 


<NONE> 


<NONE> 


1271 


X16318 


Canine mRNA for 
sisnal recognition 
particle 54k protein 


le-05 


3122612 


PITUITARY HOMEOBOX 3 
(HOMEOBOX PROTEIN 
PITX3) >gi|2645427 
(AF005772) homeobo.x protein 
Pitx3 fMus muscuEusl 


4.4 


1272 


AB012105 


Brassica rapa mRNA 
tor SLG45. complete 
cds * 


le-05 


1652458 


(D90905) DNA mismatch repair 
jroietn MutL (Synechocystis 
sp.I 


0.62 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor f BlastX vs. No n- Redundant Pre 


)teins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


nF^rRiFTioisj 

YJ LJV. fvi_i Hull 


P VALUE 
















1273 


U57843 


Human 

phosphatidyiinositol 
3 -kinase delta 
catalytic subunit 
mRNA. complete cds 


le-05 


475909 


(X67098) ORF1A [Homo 
sapiens] 


0.22 


1274 


296569 


H.sapiens tetomeric 
DNA sequence, clone 
2QTEL054, read 
2OTELOO054.seq 


le-05 


2137043 


unknown protein - rabbit 
(fragment) cuniculus] 


0.005 


1275 


AE000810 


Metha nobac te ri u m 
thermoautotrophicum 
from bases 172512 to 
1S2957 (section 16 of 
148) of the complete 
genome 


Ie-05 


3877579 


kinensin-like protein KIF4 
(SW:P33174); cDNA EST 
EMBL:D27320 comes from this 
gene; cDNA EST 
EMBL:D27322 comes from this 
gene; cDNA EST 

gene; cDNA EST 
EMBL:D35764 comes... Mouse 
kinensin-like protein KIF4 
(S W:rj31/4j; cUfNA to I 
EMBL:D27320 comes from this 
gene; cDNA EST 
EMBL:D27322 comes from this 
sene; cDNA EST 
EMBL;D27321 comes from this 
gene;cDNA EST 
EMBL:D35764 comes... 


6e-27 


1276 


AB012113 


Homo sapiens gene 
for CC chemokine 
PARC precursor, 
complete cds 


9e-06 


<NONE> 


. <NONE> 




1277 


AC00583O 


Homo sapiens Xpll- 
1d4-1dd BAC GSHB- 
52411 (Genome 
Systems Human BAC 
Library), complete 
sequence [Homo 
sapiens] 


9e-06 


<NONE> 


<NONE> 


<NONE> 


1278 


DS6245 


Human MHC (HLA) 
DRB intron 1 DNA» 
partial sequence 


9e-06 


1051253 


(U37531) mucin apoprotein 
[Mus musculus] 


1.3 


1279 


D79998 


Human mRNA for 
KIAA0176 gene, 
partial cds 


9e-06 


2833253 


HYPOTHETICAL PROTEIN 
KIAA0176 sapiens] 


4e-06 



^54 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(Z-oythOj similarity to i east 






U 10246 


Toxoplasma gondii 
RH uracil 
phosphoribosyl 
transferase gene, 
complete cds. 


9e-06 


3876090 


undine kinase 

(SW;URKi_YEAST); cDNA 
EST EMBL;Z14695 comes 
from this gene; cDNA EST 
CEMSE17F comes from this 
gene; cDNA EST 
EMBL:D67355 comes from this 
gene; cDNA EST yk209hl.5 
comes from this ae... 


7e-33 




U 10246 


Toxoplasma gondii 
RH uracil 
phosphoribosyl 
transferase gene, 
complete cds. 


9e-06 


3376090 


(/.oyojj) Similarity to Yeast 
uridine kinase 

(SW:URK1__ YEAST); cDNA 
EST EMBL:Z14695 comes 
from this gene; cDNA EST 
CEMSE17F comes from this 
gene; cDNA EST 
EMBL:D67355 comes from this 
gene; cDNA EST yk209h 1.5 
comes from this 2e... 


7e-34 


1232 


AFO 12899 


Sambucus nisra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


8e-06 


<NONE> 


<NONE> 


<NONE> 


1233 


AFO 12899 


Sambucus ni«ra 

w 

ribosome inactivating 
srotein precursor 
mRNA. complete cds 


8e-06 


<NONE> 


<NONE> 


<NONE> 


1284 


U66340 


-luman Rh blood 
group C antigen 
(RHCE) gene, exon 
2, partial cds 


8e-06 


1707155 


(U80837) F07E5.6 gene product 
Caenorhabditis elesans] 


9.6 


1235 


AFO 12899 


oambucus nigra 
ribosome inactivatins 
protein precursor 
mRNA. complete cds 


7e-06 


<NONE> 


<NONE> 


<NONE> 


1236 


M29930 


Human insulin 
receptor (allele 2) 
gene. e.\ons 14, 15. 
16 and 17. 


4e-06 


<NONE> 


<NONE> 


<NONE> 


1237 


L42103 


Homo sapiens 
subclone 5_d3 from 
PI H25) DNA 
sequence. 


3e-06 


<NONE> 


. <NONE> 


<NONE> 
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Nearest Neighbor ' BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






VI us musculus 










1238 


AFO 12244 


cerberus-like (Cer-1) 
pene. complete cds 


3e-06 


<NONE> 


<NONE> 


<NONE> 


P89 


Z69366 


Human DNA 

c/^nn^nr*^ From 

cosmid L96F8, 
Huntinston's Disease 

w 

D Ar*i«*v*% /*Uf/>fT> ACAIflP 

Keg ion. cnromobuinc 
4pl6.3 contains EST. 


3e-06 


<NONE> 


<NONE> 


<NONE> 


1290 


Z69366 


sequence from 
cosmid L96FS. 
Huntington s L/iseasc 
Region, chromosome 
4pl6.3 contains EST. 


- . 
3e-06 


<NONE> 


<NONE> 


<NONE> 


1291 


XS5232 


H.sapiens 
chromosome 3 
sequences 


3e-06 


<NONE> 


<NONE> 


<NONF> 


1292 


M32674 


Human platelet 
glycoprotein Ilia, 
exons 7. S and 9. 


3e-06 


<NONE> 


<NONE> 


<NONE> 


1293 


D16S79 


Human HepG2 3' 
region cDNA. clone 
hmd2a01 


3e-06 


998296 


(U33484) ependymin 
[Hemiodus sp.] 


5.6 


L 7*+ 


UIS614 


Laaothrtx laaotncha 
in Lerpnoiorcccpiui 
retinoid-binding 
protein (IRBP) gene, 
intron I. complete 
sequence 


3e-06 


1613846 


HJ71440} nolvrjrotein [Rice 
tunsro spherical virusl 


5.0 


1295 


AF090115 


Lyco per s icon 
esculentum cytosolic 
class II small heat 
shock protein HCT2 
(HSP17.4) mRNA, 
complete cds 


3e-06 


1477646 


(U53204) plectin [Homo 
sapiens] >gi|l477651 (U63610) 
plectin [Homo sapiens] 


4.0 


1296 


AF01689S 


Homo sapiens B-ATF 
acne, complete cds 


3e-06 


1085177 


reverse transcriptase - fruit fly 
reverse transcriptase 
[Drosophila vakuba] 


3.0 


1297 


ABO 1 8490 


Homo sapiens DNA, 
trinucleotide repeats 
reaion 


3e-06 


3876572 


(Z81522) predicted using 
Genefinder; similar to RNA 
recognition motif, (aka RRM, 
RBD.or RNP domain) 
[Caenorhabditis elegans] 


3.0 



13$ 
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1293 



AF027174 



Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath 
B) mRNA, complete 
cds 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
1 10 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 



3e-06 



4240137 



(AB020631) KIAA0824 protein 
[Homo sapiens] 



2.7 



1299 



M37929 



Homo sapiens 
adenosine 
monophosphate 
deaminase 1 
(AMPD1) gene, 
exohs 1 1-12. 



3e-06 



1653775 



(D90916) thiol:disulfide 
interchange protein DsbD 
[Synechocystis sp l 



1.7 



1300 



1301 



M37929 



Homo sapiens 
adenosine 
monophosphate 
deaminase 1 
( AMPD I ) gene, 
exons 11-12. 



U60496 



Glvcine max actin 
[Soy86) gene, partial 

cds 

Yersinia 

pseudotuberculosis 
rpiC, rplD.rplW, 
rplB and rpsS genes 
forribosomal proteins 
L3, L4. L23, L2 and 
S19 



3e-06 



1653775 



3e-06 



1730738 



(D90916) thiol:disulfide 
interchange protein DsbD 
[Synechocystis sp.] 



1.7 



ACTIN-LIKE PROTEIN ARP5 
YnI2430p [Saccharornyces 
cereytsiae] 



2e-05 



1302 



1303 



1304 



XI 4363 



Z34969 



X64707 



3e-06 



585879 



H.sapiens DNA for 

microsateltite 

polymorphism 



2e-06 



<NONE> 



H.sapiens BBCl 

mRNA 

Homo sapiens Xp22- 

154-155 BAC GSHB 

5241 1 (Genome 

Systems Human BAC 

Library), complete 

sequence [Homo 

sapiens] 



le-06 



<NONE> 



50S RIBOSOMAL PROTEIN 
L2 maritima >gi|437926 
(Z21677) ribosomal protein L2 



2e-I2 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



1305 



1306 



AC005830 



J04058 



Human electron 
transfer flavoprotein 
alphu-subunil mRNA. 
complete cds, 



le-06 



<NONE> 



le-06 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. No n- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1 j\Ji 


L25647 


Homo sapiens 
fibroblast growth 
factor receptor gene 
(located in ihe central 
MHO signal peptide 
and consecutive exon 


le-06 


1586734 


mxcv^ gene iMetnyiooactenum 
organophiluml 


5.4 


1308 


L26261 


Human MHC class III 
HLA-RP1 sene. 


le-06 


1684985 


(U20633) NADH 
dehydrogenase subunit 
[Neuwiedia veratri folia] 


1.8 


1309 


AF0022S3 


Mus musculus alpha- 
actinin-2 associated 
LIM protein mRNA, 
alternatively spliced 
product, complete cds 


- • 

le-06 


2996196 


(AF053367) carboxyl terminal 
LIM domain protein [Mus 
musculus] 


• 

4e-17 


Ij 10 


M 10935 


Human haptoglobin 
gene (alpha-2 allele), 
complete cds and 
haptoglobin-related 
eene, exon 1 and 
three Alu repeats. 


6e-07 


<NONE> 


<NONE> 


<NONE> 


1311 


AC002251 


Homo sapiens 
(subclone l_g6 from 
BAC H76) DNA 
sequence 


4e-07 


2144491 


coagulation factor Xa (EC 
3.4.2 1.6) precursor norvegicus] 


4.2 


1312 


AF047717 


Streptomyces 
chrysomallus 
actinomycin 
synthetase II (acmB) 
gene, complete cds 


4e-07 


699196 


(U 15 1 81) 4-coumarate-coA 
liaase [Mvcobacterium leprae] 


le-06 


1313 


U144I7 


Human Ral guanine 
nucleotide 
dissociation 
stimulator mRNA, 
partial cds. 


4e-07 


544402 


(AiAiMiVh NULLbUliiJb 
DISSOCIATION 
STIMULATOR RALGDS 
FORM A (R.ALGER 
>gi|321257|pir||S2S4l5 guanine 
nucleotide dissociation 
stimulator ralGDS - mouse 
>gi| 193573 (L07924) guanine 
nucleotide dissociation 
stimulator [Mus musculus] 


Se-OS 


1314 


279027 


H.sapiens flow-sorted 
chromosome 6 
Hindlll fragment. 
SC6pA20G3 


3e-07 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 



Nearest Neighbor fBlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neigh bor (B las tX vs. Non- Redundant Proteins) 

^ — " — 



ACCESSION 



Homo sapiens 



DESCRIPTION 



P VALUE 



1315 



U67I67 



intestinal mucin 
(MUC2) gene, 
promoter region and 
partial cds 



3e-07 



<NONE> 



<NONE> 



<NONE> 



1316 



1317 



AF086256 



Homo sapiens hill 
length insert cDNA 
clone ZD41C11 



3e-07 



<NONE> 



<NONE> 



<NONE> 



U67228 



Human clone HS4.61 
Alu-Ya5 sequence 



3e-07 



1938437 



(U97003) contains similarity to 
C4-type zinc fingers and a 
ligand-binding domain of 
nuclear hormone receptors 



2.3 



1313 



1319 



U94346 



Human calpain-Iike 
protease (htra-3) 
mRNA. complete cds 



3e-07 



2911358 



(AF047659) No definition line 
found [Caenorhabditis elegans] 



0.39 



Y 15724 



Homo sapiens 
SERCA3 gene, exons 
1-7 (and joined CDS; 



le-07 



<NONE> 



<NONE> 



<NONE> 



1320 



XI 3596 



Bean DNA for 
glvcine-rich cell wall 
protein GRP 1 .3 



le-07 



<NONE> 



<NONE> 



<NONE> 



1321 



MS3094 



Homo sapiens 
cytosolic selenium- 
dependent glutathione 
peroxidase gene, 
complete cds» and 
rhohl2 gene. 3* end. 



le-07 



1326385 



(U5875l)C07G1.7gene 
product [Caenorhabditis 
eieaansl 



8.0 



1322 



Z55905 



H.sapiens CpG DNA, 
clone 71 f4, forward 
read cpg71f4.ftla 



le-07 



1076802 



extensin-like protein - maize 
>gi|600113 mays] 



0.61 



132 



1324 



X0354 1 



Human mRNA of trk 
oncogene > :: 

b|I96l86|I96lS6 
Sequence 23 from 
patent US 5734039 



le-07 



325465 



AF027766 



Canis familiaris Y- 
linked zinc finger 
protein 



(M74509) [Human endogenous 
retrovirus type C oncovirus 
sequence.], gene product [Homo 
sapiens] 



3e-04 



le-07 



22064; 



D13613 



Bovine mRNA for 
rabphilin-3A, 
complete cds > :: 
dbj|E07809|E07S09 
cDNA encoding 
rabphilin-3A 



X> 10628) zinc finger protein 
Mus musculus) 



7e-0S 



le-07 



2822161 



AC004082) rab3 effector-like; 
35% Similarity to AF007S36 
(PID:g23 17778) [Homo 
sapiens] 



6e 1 1 
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Nearest Neighbor (BlastN vs. GenbanJc) 


Nearest Neiehbor ( BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human mRNA for c- 






(J04169) sag-one fusion protein 




1326 


X57U0 


cbl proto-oncosene 


le-07 


323270 


[Cas NSl retrovirus! 


3e-l4 


1327 


X57110 


Human mRNA for c- 
cbl proio-oncoeene 


le-07 


115855 


human >gi|29731 (X57H0)c- 
cbl protein [Homo sapiens] 


4e-l9 


1328 


AC001178 


Homo sapiens 
(subclone 2_gl2 trom 
BAC H94) DNA 
sequence 


4e-08 


<NONE> 


<NONE> 


<NONE> 


1329 


UU866 


Human interIeukin-8 
receptor type B 
(ILSRB) gene, 
promoter and exons 1- 
6 


4e-08 


<NONE> 


<NONE> 


<NONE> 


1330 


AC001225 


Homo sapiens 
(subclone 2_e6 from 
BAC H94) DNA 
sequence 


4e-0S 


478184 


histone HI II- 1 (clone L95) - 
mtdee 


O.J 


1331 


M73837 


Human modulator 
recognition factor 2 
(MRF-2) mRNA, 
complete cds. 


4e-08 


141448 


HYPOTHETICAL 32.6 Kb 

PROTEIN IN TRAiNSPOSON 

XXI ->.r»ItQmSSInirll rOfXl^S 
1 IN-O JO ->gl joU / JO|pii||J \£Kr+~a 

hypothetical 32. 6 K protein - 
Streptomyces fradiae transposon 
Tn4556 


4.7 


1332 


AC006164 


Homo sapiens clone 
UWGC:y28gap from 
6p2L complete 
sequence [Homo 
sapiens] 


4e-0S 


2580578 


(AF000996) ubiquitous TPR 
motif, Y isoform [Homo 
sapiens] 


1.2 


i •> ^ 
1 jj j 


X01060 


Human mRNA for 
transferrin receptor 


4e-0S 


135514 


T-CELL RECEPTOR BETA 
CHAIN PRECURSOR 
precursor (ANA 1 1) - rabbit 


0.61 


1334 


Y 10697 


H. sapiens INE2 
mRNA 


4e-0S 


124909 


INSULIN RECEPTOR- 
RELATED PROTEIN 
PRECURSOR (IRR) (IR- 
RELATED RECEPTOR) 
>2i|IS6555 sapiensl 


0.14 


1335 


U60416 


Rattus norvegicus 
myr 6 myosin heavy 
chain mRNA. 
complete cds 


4e-0S 


102189 


myosin I. high molecular weight 
- Acanthamoeba sp 


3e-0S 
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Nearest Neighbor (BiastN vs. Genbank) 


Nearest Neishbor (BiastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1336 


U23804 


Drosophila 
melanogaster putative 

flTTP-HtnHiiio 
*J i r-uiuuiiiij^ 

regulatory protein 
beta chain (GPB) 
mRNA. partial cds. 


4e-08 


2494916 


TRP-AfiSP REPEATS 
CONTAINING PROTEIN 
T10F2.4 IN CHROMOSOME 
in protein, similar to u-oeta 
repeat region (Trp-Asp 
domains) of guanine nucleotide 
binding protein 


Ie-28 . 


1337 


AE0002I3 


Escherichia coli K-12 

IVlwlOJJ SeCUUfJ LKJJ 

of 400 of the 
complete genome 


4e-08 


. 3294172 


(AL022325)rF27C3.Ll 
(protein similar to C. elegans 
protein B0035.16) (isoform I) 
[Homo sapiens] 


2e-67 


1338 


D8982I 


Mus musculus mRNA 
for RhoM, complete 
cds 


2e-08 


3024539 


R HO- RELATED GTP- 
BINDING PROTEIN RHOD 
(RHO- RELATED PROTEIN 
HPl)(RHOHP I) sapiens] 


le-04 


1339 


U74382 


Human telomeric 
repeat DNA-binding 
protein IPIN2) 
mRNA. complete cds 


te-08 


<NONE> 


<NONE> 


<NONE> 


1340 




Homo sapiens 
(subclone H8 5_al0 
from PI 35 H5 CS) 
DNA sequence. 


1 - AO 


^ it T/*~ \m ff 

<NONE> 


<NONE> 


<NONE> 


1341 


L21936 


Human succinate 
dehydrogenase 
flavoprotein subunit 


Ie-08 


320 I 673 


(AF060386) adenine 
)hosphoribosy!transferase 
Leishmania tarentolael 


4.0 


1 34^ 


AB009777 


Homo sapiens gene 
for osteon idogen, 
promoter region 


le-08 


479388 


tritin - wheat 

>ei|39I929|enI|PID|d 1003454 


2.2 


1343 


M58600 


Human heparin 
eofactor II fHCF'M 

aene, exons I throush 

5. 


Le-08 


1730173 


GLLfCOSE-6-PHOSPHATE 
ISOMER AS E, CYTOSOLIC 2 

vji I ) ^rnUornUULULUot, 

ISOMERASE) (PGI) isomerase 
Clarkia concinna] 


1.9 


1344 


M5S600 


-luman heparin 
eofactor II (HCF2) 
sene. exons I throush 
5. 


le-OS 


1730173 


GLUCOSE-6-PHOSPHATE 
ISOMERASE. CYTOSOLIC 2 
(GPI) (PHOSPHOGLUCOSE 
ISOMERASE) (PGI) isomerase 
Clarkia concinna] 


1.7 


1345 


AC0009S0 


-lomo sapiens 
(subclone l_g2 from 
PI H3I) DNA 
sequence 


Ie-08 


439S77 


L27428) reverse transcriptase 
Homo sapiens] 


LI 
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Nearest Neighbor f BtastN vs. Gcnbank) 


Nearest Neishbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1346 


U48734 


Human non-muscle 
alpha-actinin mRNA, 
complete cds 


le-08 


168237 


(M76546) hydroxyproline-rich 
protein [Helianthus annuusl 


0.19 


1347 


M76724 


Human leukocyte 
adhesion receptor 
alpha subunit 


le-08 


1 177607 


(X92485) pval [Plasmodium 
vivaxl 


0.19 


1348 


AF067959 


Gallus gallus 
homeodomain protein 
HOXD-3 mRNA, 
complete cds 


le-08 


• - 3165574 


(AF067942) No definition line 
found [Caenorhabditiselegans] 


0.15 


1349 


Z81014 


Human DNA 
sequence from 
cosmid U65A4, 
between markers 
DXS366 and DXS87 
on chromosome X * 


le-08 


2072964 


(U93569) putative pl50 [Homo 
sapiens 1 


0.001 


1350 


X57103 


Human h-Iys gene for 
lyso2yme (upstream 
reaion) 


7e-09 


<NONE> 


<NONE> 


<NONE> 


1351 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


7e-09 


231629 


BELE-S ALT- ACTIVATED 
LIPASE PRECURSOR ESTER 
LIPASE) (STEROL 
ESTERASE) (CHOLESTEROL 
ESTERASE) salt-activated 
lipase [Homo sapiens] sapiens] 


0.22 


1352 


L34741 


Aplysia californica 
prohormone 
convertase (PC2) 
mRNA, complete cds. 


5e-09 


322054 


cytochrome-c oxidase (EC 
1.9.3.1) chain II precursor - 
Synechocystis sp. (PCL osOjj 
>°i|58l739 sp.] 


5.0 


1353 


AF052959 


Homo sapiens type 
XV collagen 
(COL15A1) gene. 
e\on 6 


4c-09 


131269 


PHOTOS YSTEM II P680 
CHLOROPHYLL A 
APOPROTEIN (CP-47 
PROTEIN) 

>gi|7270S|pir||QJLV6A 
photosystem II chlorophyll a- 
binding protein psbB - liverwort 
(Marchantia polymorpha) 
chloroplast >ei|l 1700 


1.8 
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Nearest Neiahbor i BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1354 


L 15470 


Streptomyces 
clavuiigerus (NRRL 
3585) clavulanic acid 
biosynthesis protein 
(cla) gene, complete 
cds and clavaminate 
synthase 2 (cs2) gene, 
partial cds. 


4e-09 


• 

58602S 


rAOMATIKE 

UREOHYDROLASE) (AUH) 
(PROCLAVAiMLNIC ACID 
AMID LNO HYDROLASE) 
>gi|1361423|pir||S57669 
Proclavaminic acid amidino 
hydrolase - Streptomyces 
clavuiigerus >gi|295 171 
Proclavaminic acid amidino 
hydrolase [Streptomyces 
clavuiigerus] 

>gi|1586122|prfl|2203286B 
proclavaminic acid amidino 
hydrolase (Streptomyces 
clavuliaerusl 


4c- 1 3 


1355 


ABCO2302 


Human mRNA for 
KIAA0304 2e ne, 
complete cds 


2e-09 


131600 


GENERAL SECRETION 
PATHWAY PROTEIN L 
product [Klebsiella pneumoniae] 
>gi|I493ll (M32613) pulL 


o 5 


1356 


L342I9 


Homo sapiens 
retinaldehyde-binding 
protein (CRALBP) 
gene, complete cds. 


le-09 


<NONE> 


<NONE> 


<NONE> 


1357 


AB002302 


Human mRNA tor 
KIAA0304 gene, 
complete cds 


Ie-09 


2224549 


(AB002302) KIAA0304 [Homo 
sapiens] 


5.0 


1358 


DS573 1 


Homo sapiens 
HSPA1L gene for 
Heat shock protein 70 
testis variant, 5 'UTR, 
partial sequence 


le-09 


1389766 


(U5865S) unknown [Homo 
sapiensl 


1.3 


1359 


AF0644S3 


Homo sapiens natural 
resistance -associated 
macrophage protein 2 
(NRAMP2) gene, 
exon 17. alternatively 
spliced non-LRE 
form, complete cds 


8e-l0 


113671 


»!!! ALU CLASS F WARNING 
ENTRY !!!! 


0.72 
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Nearest Neishbor fBlastN vs. Genbank) 


Nearest Neighbor (Bias tX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1360 


AF002283 


Mus musculus alpha- 
actinin-2 associated 
LIM protein mRNA. 
alternatively spliced 
product, complete cds 


6e-l0 


2996196 


(AF053367) carboxyl terminal 
LLM domain protein (Mus 
musculus] 


4e-21 


1361 


M26220 


African green 
monkey origin of 
replication 


5c- 10 


2143455 


gene DMR-N9 protein - mouse 
(fragment) 


8.8 


1362 


Z78006 


H.sapiens flow-sorted 
chromosome 6 
Hindlll fragment, 
SC6pA7FlO 


4e-l0 


2072977 


(U93574) putative pl50 [Homo 
sapiensl 


0.005 


1363 


U82303 


Homo sapiens 
unknown protein 
mRNA. partial cds 


2e-10 


182571! 


(US8183) similar to the 
immunoglobulin superfamily, 
most similar to nerual cell 
adhesion proteins 
[Cuenorhabditis elegans) 


0.031 


1364 


AF079764 


Drosophila 
melanoeaster 
enhancer of 
polycomb 


2e-10 


3757890 


(AF079764) enhancer of 
polycomb [Drosophila 
melanogaster] 


le-10 


1365 


L24123 


Homo sapiens NRFl 
protein (NRFt) 
mRNA. 


2e-10 


3004573 


(AC004520) similar to NFE2- 
related transcription factors; 
similar to I4S694 
(PID:g2 137676) [Homo 
sapiensl 


4e-53 


1366 


M91454 


Orangutan alpha- 
globin gene duplicate 
region. 


le-10 


464239 


NADH-UBIQUINONB 
OXIDOREDUCTASE CHAIN 
4>gi|t085lS5|pir||S52968 
NADH dehydrogenase chain 4 - 
honeybee mitochondrion 
(SCC4) >gi|552446 (L06178) 
NADH dehydrogenase subunit 4 
[Apis mellifera ligustica] 


6.0 


1367 


DS7117 


House mouse: 
Musculus domesticus 
brain mRNA for 
SAP 102, complete 
cds 


6e-tl 


4739 12 


(L31961) phosphoprotein [Mus 
cookii] 


2.2 


1368 


AC001002 


Homo sapiens 
(subclone 2Ji9 from 
PI H39) DNA 
sequence 


5e-IL 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs, Non-Redundant Proteins) 


SEQ 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Homo sapiens 












AC00I002 


(subclone 2_h9 from 
PI H39) DNA 
sequence 


5e-ll 


<NONE> 


<NONE> 


<NONE> 


1370 


ABQ07874 


Homo sapiens 
KIAA0414 mRNA, 
partial cds 


5e-ll 


<NONE> 


<NONE> 


<NONE> 


1371 


AC001002 


Homo sapiens 
(subclone 2_h9 from 
PI H39) DNA 
sequence 


5e-U 


<NONE> 


<NONE> 


<NONE> 




AC00L002 


Homo sapiens 
(subclone 2__h9 from 
PI H39) DNA 
sequence 


5e-ll 


<NONE> 


<NONE> 


<NONE> 


I J / J 


AC001002 


Homo sapiens 
(subclone 2_h9 from 
PL H39) DNA 
sequence 


5e-U 


<NONE> 


<NONE> 


<NONE> 


1374 


AC001002 


Homo sapiens 
(subclone 2_h9 from 
PI H39) DNA 
sequence 


5e-ll 


<NONE> 


<NONE> 


<NONE> 


1375 


Z21S52 


H.sapiens mRNA for 
KERV-K Ion- 
term inal reDeat 


5e-ll 


419481 


jag polyprotein - human 
endoeenous virus S71 


4.6 


1376 


AB007928 


Homo sapiens mRNA 
tor KIAA0459 
protein, partial cds 


Sell 


2947238 


(AF05 1782) diaphanous I 
Homo sapiens] 


2.3 


!377 


DS71I7 


riouse mouse, 
Musculus domesticus 
brain mRNA for 
SAP 102, complete 
cds 


Sell 


473912 


(L3 196 1) phosphoprotein [Mus 
cookiil 


i.S 


1373 


AJ13I501 


Homo Sapiens DNA 
sequence between 
two AML I gene 
promoters, 6423 BP 


5e-ll 


728831 


!!!! ALU SUBFAMILY J 
WARNING ENTRY 


0.20 


1379 


M27S26 1 


Human endogenous 
retroviral protease 
mRNA, complete cds. 


5e-lt 


8855S 


retroviral proteinase-like protein 
- human 


0.002 
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Nearest Neighbor (Blast N vs. Cenbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












HYPO i Hfc 1IL AL 5x1 KD 




nso 


U23804 


Drosophila 

GTP-binding 
regulatory protein 
beta cnain (urb) 
mRNA. partial cds. 


5e-ll 


2494916 


TRF-ASP RjLPbA'K 
CONTAINING PROTEIN 
T10F2.4 IN CHROMOSOME 
III protein; similar to G-Beta 
repeat region (Trp-Asp 
rfnrminO of »t in nine nucleotide 

binding protein 


le-30 


1381 


Z22784 


M.musculus troponin 
I aene. 


JC- l l 




(AF072889) transcription 
rcprcobur uruin iuliui — 


0 053 


1382 


AB007880 


Homo sapiens 
KIAA0420 mRNA, 
complete cds 


ze- 1 1 


<*r* uli 




<NONB> 


1383 


AF020361 


9 Homo sapiens BAX 
gene, exon 6, partial 
sequence 


2e-ll 


<NONE> 


<NONE> 


<NONE> 


1384 


L35600 


Homo sapiens DNA 
sequence. 


2e-Il 


1174952 


CtJ YrOPROTFIN D 
PRECURSOR gD [Bovine 
herpesvirus I] 


0.25 


1385 


U21943 


Human orsanic anion 

transporting 

polypeptide 


It- L 1 


li JQJ.J.J 


(U9501 1) brain -specific organic 
anion transponer 




1386 


U90878 


Homo sapiens 
carboxyl terminal 
LIM domain protein 


2e-Il 


2996196 


(Af 05 3367) carboxyl terminal 
LIM domain protein [Mus 
musculus] 


4e-23 


1387 


U31929 


Human orphan 
nuc lear receptor 
(DAXl)gcnc, 
complete cds 


6c- 12 


<NONE> 


<NONE> 


<NONE> 


1388 


M25828 


Human von 
Willebrand factor 
gene, exon 1,2, and 
3, and three Alu 
repetitive elements. 


oe-U 


<lNUlNt> 




<r MONF> 


1389 


AB020648 


for KIAA0S41 
protein, partial cds 


3e-12 


<NONE> 


<NONE> 


<NONE> 


1390 


Z 15026 


H sapiens «enes for 
tumor necrosis factor 
(Tnfa) and 
lvmphotoxine (Tnfb) 


2c- 12 


<NONE> 


<NONE> 


<NONE> 


1391 


L23101 


Homo sapiens 
kallisiatin tPl4) uene. 
exons 1-4, complete 
cds 


2e-12 


<NONE> 


<NONE> 


<NONE> 


1392 


Z47046 


Human cosmid 
OLL2C9 from Xq2S 


2e-12 


<N0NE> 


<NONE> 


<NONE> 
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Nearest Neighbor (Bias tN vs. Genbank) 


Nearest Neiahbor (BlasiX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






H. sapiens flow-sorted 










1393 


Z79007 


chromosome 6 
Hindlll fragment* 
SC6nA'>0E'> 


2e-P 


I0632 7 


hypothetical protein (LIH 3' 

r^oinn^ - fotimtn 

icsii/iij iiujiian 




1394 


U34377 


Human tyrosine 
kinase TXK (txk) 
gene, exon 13- 


le-12 


151484 


(M55524) ORE 4; putative 
[Pseudomonas aeruginosa] 


4.3 


1395 


D70845 


Mus musculus apg-l 
Eene for novel 
member of heat shock 
protein 1 10, promoter 
resion 


le-12 


113658 


ALKALINE PROTEINASE 
PRECURSOR (ALP) precursor J 
fungus (Acremonium 
chrysogenum) 


3.5 


1396 


M6397S 


Human vascular 

^nHnfhpl in 1 orowth 

factor gene, exon 8. 


le-12 


3982737 


(AF06973 1) calmodulin- 
uepenuent protein run use u pcia 
M isoform [Rattus norvegicus] 


0.083 


1397 




Homo sapiens 
lysosomal alpha- 
mannosidase (manB) 

mJ?IVA rnmnlf*fp f*rl^ 
JlLrvMrt, VUM i^/i tit IUj 






<1N O 


<XNUlNC> 


1398 


Z68297 


Caenorhabditis 
elegans cosmid 
FI IA10. complete 
sequence 
[Caenorhabditis 
elegans] 


7e-13 


2393734 


(AC002542) similar toC. 
elegans Fl 1 A 10-5; $0% 
similarity to Z68297 
<PlD:gi 130619) [Homo 
sapiens ) 


5e-34 


1399 


Z68297 


Caenorhabditis 
eleaans cosmid 
F11A10 comDlete 
sequence 
[Caenorhabditis 
elegans) 


7e-l3 


2393734 


(AC002542) similar toC. 
?fe»an<; Fl 1 A 10 S- 80% 
similarity toZ68297 
(PID:gU30619) [Homo 
sapiens] 


3e-3S 


1400 


Z6S885 


Human DNA 
sequence from 
cosmid L21F12B. ' 
Huntington's Disease 
Region, chromosome 
4pl6.3. contains 
EST. 


6e-13 


<NONE> 


<NONE> 


<NONE> 


1401 


X76104 


H.sapiens DAP- 
kinase mRNA 


6e-l3 


291U54 


(AB007143) ZIP-kinase [Mus 
musculus] 


0.007 


1402 


Z7866S 


H.sapiens flow- sorted 
chromosome 6TaqI 
fragment. 
SC6pA13G4 


5e- 13 


106322 


hypothetical protein (LIH 3' 
reaion) - human 


2e-06 


1403 


L35600 


Homo sapiens DNA 
sequence. 


3e-13 


3184290 


(AC004L36) hypothetical 
protein [Arabidopsis thalianal 


1.7 
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Nearest Neiehbor (BlasiN vs. Genbank) 


Nearest Neighbor ( BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Cloning vector 










1404 


AF090452 


pKODT complete 
sequence 


2e-13 


3876730 


(Z49966) F35CU.4 
[Caenorhabditis clegans] 


7.8 


1405 


D28126 


Human gene for ATP 
synthase alpha 
subunit, complete cds 
(exon 1 to 12} 


2e-13 


419481 


gag poiyprotein - human 
endogenous virus S71 


3.4 


1406 


AF005219 


Homo sapiens 
transcription factor 
HOXDI3 


2e-l3 


2822166 


(AC004080) transcription factor 
HOXA13 [Homo sapiensl 


5e-09 


1407 


ABO 1830 1 


Homo sapiens mRNA 
forKIAA0758 
protein, partial cds 


2e-'13 


3882237 


(AB018301) KIAA0758 protein 
(Homo sapiens] 


le-23 


1408 


D70845 


Mus musculus apg- 1 
gene for novel 
member of heat shock 
protein 1 10, promoter 
region 


le-13 


113658 


ALKALINE PROTEINASE 
PRECURSOR (ALP) precursor - 
fungus ( Acremonium 
chrvsosenum) 


3.1 


1409 


AG000691 


Homo sapiens 
genomic DNA, 2 lq 
region, clone: 
T171BG33 


8e-14 


930045 


(X15332) alpha- 1 (III) collagen 
[Homo sapiens] 


3e-04 


1410 


D30785 


Mouse mRNA for 
neuropsin, complete 
cds 


8e-l4 


3559978 


(AJ005641) serine protease 
[Rattus rattus] 


2e-12 


1411 


U32710 


Haemophilus 
influenzae Rd section 
25 of 163 of the 
complete aenome 


8e-14 


4106673 


(AL035064) queuine trna- 
ribosyl transferase 
(Schizosaccharomyces pombe] 


2e-38 


1412 


AC000886 


Homo sapiens 
genomic DNA. 2 lq 
region, clone: 
64E11X19 


7e-l4 


1363925 


hypothetical protein 2 - North 
American opossum (fragment) 
>gi|S97721 (248955) ORF-2, 
putative RT [Didelphis 
virsiniana] 


LI 


1413 


Z62664 


H.sapiens CpG DNA, 
clone 7 Id U t forward 
readcpg7ldl l.ftla . 


7e-14 


3953461 


(AC00232S) F20N2.6 
Arabidopsis thaliana] 


0.085 


1414 


AB014532 


Homo sapiens mRNA 
forKIAA0632 
protein, partial cds 


7e-14 


113668 


!!!! ALU CLASS C WARNING 
ENTRY !!!! 


0.040 
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SEQ 
ID 



Nearest Neighbor fBlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



P VALUE 



H.sapiens telomeric 
DNA sequence, clone 
20PTEL004, read 
20PTELOO004.seq 



1415 



Z96478 



7c- 1 4 



1416 



AF100694 



14 17 



AFO 12899 



2981631 



(AB012223) ORF2 [Canis 
familiaris] 



Mus musculus 
Pontin52 mRNA, 
complete cds 



4e-l4 



<NONE> 



<NONE> 



Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 



4e-I4 



<NONE> 



<NONE> 



2e-04 



<NONE> 



<NONE> 



1418 



AF033349 



Homo sapiens MLL 
gene breakpoint 
cluster region, intron 
1 . partial sequence 



3e-14 



728831 



!'!! ALU SUBFAMILY J 
WARNING ENTRY 



1419 



AC00I526 



1420 



AF012899 



Homo sapiens 
(subclone 4_f6 from 
Pi H54) DNA 
sequence 



3c- 14 



99861 



extensin - almond >gi|20420 
(X65718) extensin 



Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 



3e-l4 



728832 



!!! ALU SUBFAMILY SB 
WARNING ENTRY 



1421 



AF 100694 



Mus musculus 
Pontin52 mRNA. 
complete cds 



2e-l4 



3913573 



EPHRLN-A2 PRECURSOR 
(EPH-RELATED RECEPTOR 
TYROSINE KINASE LIGAND 
6) (LERK-6) sapiens] 
>gi|292476l (ACO04258) 
EPL6_HUMAN [Homo sa piens! 



9.2 



0.15 



8.7 



1422 



AF012S99 



Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 



9e-15 



119040 



LIB FKU1LIN, SMALL 
ANTIGEN (E IB I9K) 
>gi|74142|pir||Ql.AD25 early 

: IB 2 IK protein II - human 
adenovirus 5 >gi|5S489 
(X02996) mRNA 5 first reading 

rame [Human adenovirus type 

] adenovirus type 5] 
>gi|209797 (J01969) 2 L kD 
protein 



L5 
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Nearest Neighbor fBlustN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Pre 


neins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












transcription factor GATA-4, 




1423 


AF0 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


8e-l5 


477 102 


retinoic acid- inducible - mouse 
>gi|293345 (M98339) GATA- 
binding transcription factor 
[Mus musculus] 


0.57 


1424 


ABO 12223 


Canis familiaris LINE 
I element ORF2 
mRNA. complete cds 


8e-I5 


92385 


hypothetical protein - rat 
(fraement) 


0.003 


1425 


API 00694 


Mus muse ul us 
Pontin52 mRNA, 
complete cds 


3c-15 


<NONE> 


<NONE> 


<NONE> 


L426 


XI2433 


Human pHSl-2 
mRNA with ORF 
homologous to 
membrane receptor 
proteins 


3e-l5 


422532 


collagen alpha 3(IV) chain - sea 
urchin 


8.9 


1427 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. ^mplete cds 


3e-15 


1353143 


PROBABLE NUL'LkAk 
HORMONE RECEPTOR 
E02H1.7 

>gi |3 875 43 1 |sn l|PID|e 1 344980 
(Z47075) similar to Zinc finger, 
C4 type (two domains) 
[Caenorhabditis eleaans] 


5.0 


1428 


Z6965 I 


Human DNA 
sequence from 
cosmid L75B9. 
Huntington's Disease 
Region, chromosome 
4pl6.3 


3e-15 


403460 


(L24521) transformation-related 
protein [Homo sapiens] 


0.60 


1429 


AFO 12399 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


2e-15 


108750 


Ig heavy chain precursor 
(B/MT.4A. 17. Hi. AD> - oovine 
>gi|440(X629l6) anti- 
testosterone antibody [Bos 
taurus] 


1. 1 


1430 


X83299 


H. sapiens SMA3 
mRNA 


2e-15 


671530 


(X83299) SMA3 gene product 
[Homo sapiensl 


0.32 


1431 


U01877 


Human p300 protein 
mRNA, complete cds. 
> :: gb|I62297|I62297 
Sec ue nee I from 

* 

patent US 565S734 


2e-l5 


3024341 


El A- ASSOCIATED PROTEIN 
P300 


0.019 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiahbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












H Y I Hh 1 1LAL 45. 1 KJJ 




1432 


X16516 


Mouse MHC (Qa) Q2 
k gene for class I 
antisen, exons4-8 


le-15 


2496897 


PROTEIN LI 6L1U. 6 IN ' " 
CHROMOSOME HI 
>gi|3874384|gnl|P[D|e 1 344078 
EST EMBL:C08256 comes 
from this gene; cDNA EST 
EMBL:C0994l comes from this 
gene; cDNA EST yk340a 10.3 
comes from this gene; cDNA 
EST yk340al0.5 comes from 
this sene [Ca... 


7e-08 


1433 


M74165 


Chicken tensin 
mRNA. complete cds. 


le-rs- 


283920 


tensin - chicken >gi|2 12752 
(M741 65) tensin 


2c- 1 9 


1434 


X71393 


H saDiens eene tor 
immunoglobulin 
kappa light chain 
variable region 04 
and 05 


9e-16 


<NONE> 


<NONE> 


<NONE> 


1435 


U05227 


Human Rar protein 
mRNA, complete cds. 


9e-16 


3036779 


(ZSS44;y) match: multiple 
proteins; match: 000407 
Q 12829 P22127 P36S61 
Q40219; match: P70550 
Q41022 P22125 Q08155 
P35^86- match: P51 14S P51 147 
P35293 P36861 P352S9; match: 
P35284Q40217 P51I52 
P51157 P51 158; match: Q41022 


3e-06 


1436 


M 2 3404 


Chicken ervthrocvte 
anion transport 
protein (band3) 
mRNA, complete cds. 


9e-16 


726403 


(U23175) similar to anion 
exchange protein 
[Caenorhabditis elegans] 


le-28 


1437 


X16145 


Rat mRNA for liver a- 
L-Fucosidase (EC 
3.2.1.51) 


9e-16 


67502 


alpha-L-fucosidase (EC 
3,2.1.51) I precursor, tissue - 
human >gi| 178409 (M29S77) 
alpha-L-fucosidase precursor 
EC 3.2.1. 5) {Homo sapiens] 


2e-29 


143S 


AFO 12899 


Sambucus nisra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


8c- 16 


<NONE> 


<NONE> 


<NONE> 


1439 


AF0769S1 


Mus muscuius brain 
mitochondrial carrier 
protein BMCPI 
(Bmcpi) mRNA, 
complete cd$ 


8e-l6 


3S51540 


[AF078544) brain mitochondrial 
carrier protein- 1 [Homo sapiens] 


2e-13 



71 1 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlasOC vs. Non- Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






H.sapiens MN/CA9 






!!!! ALU SUBFAMILY J 




1440 


Z54349 


GENE 


5e-16 


72883 1 


WARNING ENTRY 


0.002 


1441 


AF077003 


Mus musculus SH3 
domain-containing 
adapter protein 
mRJvf A. completecds 


3e-16 


309123 


(M35526) complement 
component C5D [Mus 
musculus] 


3.1 


1442 


X64587 


M. musculus mRNA 
for splicing factor 
U2AF(65kD) 


3e-16 


2143767 


glycoprotein - rat >gi|986943 
(L08134) glycoprotein [Rattus 
norvegicusl norvegicus] 


0.003 


1443 


AB014561 


Homo sapiens rnRNA 
for KIAA0661 
protein, complete cds 


3e-l6 


3327136 


(ABO 14561) KIAA0661 protein 
[Homo sapiens] 


le-20 


1444 


i 

2739S7 


Human UNA 
sequence from 
cosmidN120B6on 
chromosome 22 
Contains ESTs. 
complete sequence 
[Homo sapiens] 


le-16 


<NONE> 


<NONE> 


<NONE> 


1445 


M5S31S 


Homo sapiens ala 
aene. 


le-16 


<NONE> 


<NONE> 


<NONE> 


1446 


U44103 


Human small GTP 
binding protein Rab9 
mRNA. complete cds 


lc-16 


1552584 


(ZS0233) hypothetical protein 
Rv0029 


L3 


1447 


AB014561 


Homo sapiens mRNA 
tbrKIAA066l 
protein, complete cds 


9c- 17 


3327136 


(AB0I4561) KIAA0661 protein 
[Homo sapiens] 


2e-20 


1448 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-17 


<NONE> 


<NONE> 


<NONE> 


1449 


M76762 


Mus musculus 
ribosomal protein (Ke 
3) gene, exons 1 to 5. 
and complete cds. 


le-17 


1073048 


pupR protein - Pseudomonas 
putida >ai|525260 


0.36 


1450 


D50561 


Human DNA. 
replication enhancing 
element (REEl) 


4e-18 


126295 


LINE-l REVERSE 

TRANSCRIPTASE 

HOMOLOG 


0.7S 


1451 


D 1643 1 


Human mRNA lor 
hepatoma-derived 
erowth factor, 
complete cds 


4e-13 


3242079 


(AJ0069S4) proline-rich protein 


0.0 IS 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins^ 


SEQ 
LD 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1452 




Mus musculus heat 
shock protein hsp40-3 
inXiM A, complete cos 


4e-io 


3H/J/U/ 


t7"TX \CY)\ Cimilnr-if-w try. D r- 

j similarity to H-SuDulis 
DNAJ protein 

(SW.DNAJ3ACSU); cDNA 
EST yk437al.5 comes from this 
gene [Caenorhabditis elegans] 


9e-25 


1453 


UOU205 


Human methyl sterol 
oxidase (ERG25) 
mRNA. complete cds 


3e- IS 


<NONE> 


<NONE> 


<NONE> 


1454 


AF038177 


Homo sapiens clone 
ZJSyy mK.NA 
sequence 


le-18 


1360775 


G protein-coupled receptor 74 - 
equine herpesvirus 2 >gi|695246 
(U20824) G protein-coupled 
receptor [Equine herpesvirus 2] 


5.1 


1455 


AB014561 


nomo sapiens rriKiNA 
for KIAA0661 
protein, complete cds 


le-18 


3327136 


(ABO 14561) KIAA0661 protein 
[Homo sapiens] 


Ie-21 


1456 


AB014561 


Homo sapiens mRNA 
for KIAA0661 
protein, complete cds 


le-18 


3327136 


(ABO 14561) KIAA0661 protein 

[Homo sapiens] 


le-22 


1457 


U34374 


Human tyrosine 
kinase TXK (txk) 
eene. exons 9 and 10. 


le-19 


<NONE> 


<NONE> 


<NONE> 


1458 


AB006969 


Hnmn vTnir^nt 

hGAAl mRNA, 
complete cds 


lc-19 


4151809 


(AF102855) synaptic SAPAP- 
interacting protein Synamon 


0.19 


1459 


AB002293 


Human mRNA for 
KIAA0295 sene, 
partial cds 


le-19 


2224531 


(AB002293) KIAA0295 (Homo 
sapiens] 


6e-l7 


1460 


Z59664 


H.sapiens CpG DNA, 
clone 16St9. reverse 
read cpgI6SP9.rtla . 


5e-20 


388025 1 


(Z82055) predicted using 
Gene finder 


6,5 


1461 


M73837 


Human modulator 
recognition factor 2 
(MRF-2) mRNA, 
complete cds. 


5e-20 


284313 


modulator recosnition factor 2 - 
human factor 2 [Homo sapiens] 


0019 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neiahbor fBlasiX vs. Non*Redundani Proteins') 


SEQ 
ID 




DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1462 


U24267 


Human pyrrolineo- 

carboxylate 

dehydrogenase 


5e-20 


2506350 


DELIA- I~FY KKUL1NE-5- 
C ARB OX YL ATE 
DEHYDROGENASE 
PRECURSOR (P5C 
DEHYDROGENASE) 
>gi|1353248 sapiens] 
>gi|1353250 (U24267) pyrroline 
5-carboxylate dehydrogenase 

>gi|1589585|prf]|2211355A 
Delta l-pyrroline-5-car boxy late 
dehydrogenase [Homo sapiensl 


5e-04 




U 13262 


Mus musculus myelin 
gene expression 
factor 


4e-20 


536926 


(U 13262) myelin gene 
expression racior [ivius 
musculus] 


3e-07 


1464 


U 13262 


Mus musculus myelin 
gene expression 
factor 


4e-iU 


5 LZ0iS/5 


(AF061832) M4 protein 
ueietton mutant [riomo sapiens) 


te-uo 


1465 


f * ^ —* r\ 

Z612j9 


H.sapiens CpG DNA. 
clone 4Sf 10, forward 
read cpg4Snu.rtla . 




ioovuvj i 


(D88747) AR401 [Arabidopsis 
in JI Una J 


Ul> 1 7 


1466 


US9915 


Mus musculus 
junctional adhesion 

m rsl&t"* 1 tl& ill m I 

inujccuic \j*iiiit 
mRNA, complete cds 


te-20 


3462455 


(U899 15) junctional adhesion 
molecule [Mus musculus] 


7e-l 1 


1467 


AF029071 


Callus gallus p52 pro- 
apototic protein 
mRNA. complete cds 


7e-22 


2599492 


(AF029071) p52 pro-apototic 
protein [Gallus gallus] 


le- 15 


1468 


M25636 


Figure 4. Nucleotide 
sequence of the 
pKS36 1.797 kb 
insert. 


6e-22 


1196398 


(M21305) unknown protein 
(Homo sapiens] 


0.65 


1469 


AB020655 


Homo sapiens mRNA 
for KIAA034S 
protein, complete cds 


6e-22 


4240325 


(AB020725) KIAA09I3 protein 
[Homo sapiens] 


le- 19 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neishbor (BlastX vs. Non-Redundant Proteins) 


SEQ 

ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












PKULOLLAO^ ALPHA 




1470 


S80935 


chorionic 

gonadotropin beta 1 
(CG beta 1) subunit 


5e-22 


115310 


UIvjlHAIN PRECURSOR 
>gi|849I7|pir||A31S93 collagen 
alpha 1(IV) chain precursor - 
fruit fly (Drosophila 
melanogaster) melanogasierj 
>gi| 157078 (M96575) type IV 
collagen pro-collagen 
[Drosophila melanogaster] 


0.027 


1471 


AF053066 


Homo sapiens 
microsatellite 
D5S2926 sequence 


2e-22 


728831 


Ml J ALU SUBFAMILY J 
WARNING ENTRY 


3e-04 


1472 


U55177 


Danio rerio carbonic 
anhydrase homolog 
CAH-Z mRNA, 
complete cds 


2e-22 


3123190 


CARBONIC ANHYDRASE 
(CARBONATE 

DEHYDRATASE) >gi|2576335 
(U55177) CAH-Z [Danio rerio] 


5e-14 


1473 


AF064250 


Gall us sal las 
ubiquitin specific 
protease 66 


2e-22 


2736064 


(AF016107) ubiquitin specific 
protease 41 (Gallus aailus] 


7e-37 


1474 


AF030830 


Homo sapiens 
pendrin (PDS) 
mRNA, complete cds 


2e-22 


729367 


DKAPRUlhhN ^L)C>VV;\. 
REGULATED IN ADENOMA) 
>gi|2I35020|pir||A47456 down- 
regulated in adenoma (DRA) - 
human >gi|29I96^ (L027S5) 
Nuclear localization signal at 
AA 569-573, 576-580, 579-583; 
acidic transcr. activ. domain 620 
640,; homeobox motif 653-676 
Homo sapiens] 


4e-53 


1475 


AF 100694 


Mus muse ul us 
Pontin52 mRNA, 
complete cds 


6e-23 


<NONE> 


<NONE> 


<NONE> 


1476 


X57398 


Human niRNA for 
pM5 protein 


3e-23 


107350 


3 m5 protein - human 
>gi|L335273i«nl|PID|e3624l 


te-04 


1477 


ABO 1099 8 


Rattus norvegicus 
PAD-Rll mRNA for 
Peptidylarginine 
deiminase type 1. 
complete cds 


2e-23 


<NONE> 


<NONE> 


<NONE> 


1478 


D 10871 


Human h NAT allele 
2-2 aene for 
arvlamine N- 
acetvltransferase 


2e-23 


171200 


(JQ4734) CDC6 protein 
Saccharomyces cerevisiae] 


9.S 


1479 


DI0S71 


Human h NAT allele 
2-2 gene for 
arylamine N- 
aceevkransferase 


2e-23 


171200 


(J04734)CDC6 protein 
Saceharomyees cerevisiaej 


S3 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (Bias tX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 




P VALUE 






Homo sapiens MLL- 










1480 


AF024541 


AF4 fusion protein 
mRNA. partial cds 


2e-23 


2136142 


serine/proline- rich FEL protein* 
splice form I - human 


Ie-20 


1481 


L13773 


Human AF-4 mRNA. 
complete cds. 


2e-23 


3063962 


(AF031404) MLL-AF4 fusion 
protein [Homo sapiens] 


le-20 


1482 


AF100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


8e-24 


<NONE> 


<NONE> 


<NONE> 


1483 


U75467 


Drosophila 

melanogaster Rga and 
Atu genes, complete 
cds 


8e-24 


1658503 


(U75467) Atu [Drosophila 
melanogaster] 


2e-37 


1484 


D 17076 


Human HepG2 partial 
cDNA, clone 
hmd5a09m5 


7e-24 


<NONE> 


<NONE> 


<NONE> 


1485 


AF 1 00694 


Mus musculus 
Poncin52 mRNA, 
complete cds 


7e-24 


1169643 


FMRFAMJD E~ RELATED 
NEUROPEPTIDES 
PRECURSOR >ei|4!620S 
(U03137) neuropeptide 
precursor FMRFamide- related 
peptide [Lvmnaea staanalis] 


7c- 10 


i486 


Ml 1 167 


Human 2SS 
ribosomal RNA eene. 


2e-24 


3875481 


(281054) predicted using 
Genefinder; Similarity to UDP- 
slucoronosvltransferases 


• 

5.1 


1487 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


2e-24 


549173 


USP1 PROTEIN PRECURSOR 
> 2 i|l69623 


1.2 


1488 


AB003468 


Cloning vector 
pAP3neo DNA, 
complete sequence 


2e-24 


987050 


(X65335) lacZ gene product 
(unidentified cloning vector] 


0.053 


1489 


X03541 


Human mRNA ot trk 
oncogene > 
gb|l96l86|I96l86 
Sequence 23 from 
patent US 5734039 


2e-24 


325465 


(M74509) (Human endogenous 
retrovirus type C oncovirus 
sequence.], gene product [Homo 
sapiens I 


3e-04 


1490 


L81652 


Homo sapiens 
(subclone 2_g 1 1 from 
PI H43) DNA 
sequence 


2e-24 


225047 


reverse transcriptase related 
protein [Homo sapiens] 


4e-12 


1491 


U95760 


Drosophila 
melanogaster 
strawberry notch 
(sno) mRNA. 
complete cds 


2e-24 


2078282 


(U95760) Sno [Drosophila 
melanoaaster] 


2e-4l 


1492 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


Se-25 


2623773 


(AF004835) tyrocidine 
synthetase 3 [Brevibacillus 
brevis] 


8.6 



WO0 102568 [file://C:\Pocuments and 



Selngsta^^ 



Page 278 of 1046 



WO 01/02568 



PCT/USOO/18374 





Nearest Neighbor (BlascN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1493 


AB002405 


Homo sapiens mRNA 
for LAK-4p, 
complete cds 


8e~25 


2496822 


HYPOTHETICAL 127.3 KD 
PROTEIN B0416.1IN 
CHROMOSOME X >gi|746502 
(U23516) B0416.1 gene product 
[Caenorhabditis elegansl 


9e-ll 


1494 


K03002 


Human mRNA from 
chromosome 15 gene 
with homology to 
MHC-HLA-SB-1 
intron A. 


8e-25 


1514614 


(X92842) nuclear protein [Mus 
musculus] 


ie-13 


.1495 


U61232 


Human tubulin- 
folding cofacror E 
mRNA, complete cds 


7e-25 


t r 

1465772 


(U61232) cofactor E [Homo 
sapiens) 


2e-05 


1496 


U 10245 


Arabidopsis thaliana 
Col- O putative RNA 
helicase A mRNA, 
complete cds. 


5e-25 


1353239 


(UI0245) putative RNA 
helicase A [Arabidopsis 
thalianal 


le-37 


1497 


XS9211 


H. sapiens DNA for 
endogenous retroviral 
like element 


3e-25 


20652 10 


(Y 127 13) Pro-Pol-dUTPase 
polvprotein 


5e-06 


1498 


L81652 


Homo sapiens 
(subclone 2_g 1 1 from 
PI H43) DNA 
sequence 


3e-25 


2072961 


(U93568) putative p 150 [Homo 
sapiens] 


5e-16 


1499 


XS2S95 


Rsapiens mRNA for 
DLG2 


2e-25 


2497511 


MAG UK P:n SUBFAMILY 
MEMBER 2 (MPP2 PROTEIN) 
(DISCS, LARGE HOMOLOG 

2) 


lc-34 


1500 


M36654 


Mouse homeo box 
2.6'(Hox-2.6) mRNA. 
complete cds. 


9e-26 


3323 169 


(AE001255) T. pallidum 
predicted coding region TP0854 


1.9 


1501 


L36315 


Mus musculus (clone 
pMLZ-1) zinc finger 
protein 


9e-26 


1806134 


(Z67747) zinc finger protein 
(Mus musculus] 


4e-05 


1502 


AB018281 


Homo sapiens mRNA 
for KIAA073S 
protein, complete cds 


9e-26 


728S31 


!!!! ALU SUBFAMILY J 
W ARNING ENTRY 


le-07 


1503 


AF0I7433 


Homo sapiens 
putative transcription 
factor CR53 


9e-26 


32I99S5 


ZINC FINGER PROTEIN ZFP- 
29 


Ic-IT 
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Nearest Nei shbor ( B lastN vs. Genbank) 


Nearest Neiahbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Homo sapiens 










1504 


AC00I225 


(subclone 2_e6 from 

Dnv nyty i^/i^rt 

sequence 


8e-26 


2653713 


f U91S°3"> small S Drotein 
[Hepatitis B virus] 


4.3 


1505 


AF 100694 


Mas m use u Jus 
Pontin52 mRNA, 
complete cds 


8e-26 


283446 


cyteine-rich surface antigen 72, 
CRP72 - Giardia lamblia 
(fragment) 


3.4 


1506 


X949I2 


H. sapiens Pr22 gene 


3e-26 


728837 


VAl ALU SUBFAMILY SQ 
WARNING ENTRY 


4e-09 


1507 


AP100694 


Pontin52 mRNA, 
complete cds 


2e-26 


• <NONE> 


<NONE> 


<NONE> 


1508 


U44103 


t-Tiimnn cmill P.TP 

nurn un smuii uir 
binding protein Rab9 
mRNA, complete cds 


le-26 


3327038 


(AB0I45I2) KIAA06I2 protein 
[Homo sapiens] 


8.7 


1509 


AC! fWAtVI 


Mus musculus 
Pontin52 mRNA, 
complete cas 


ye-- / 


t +UJ04j 1 t 


(AC00D990) Contains repeated 
region with 'iimilaritv to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|234l65 and -b|Z 18788 
come from this gene. 
[ rtrauiuopsts inauanaj 


\J. 1 "+ 


1510 


AOUOl 2 12 


Homo sapiens 
genomic DNA, 21q 
region, clone: 
9H1 IN4o 


ye-i / 




LJNE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG protein 
I tNVLticeDUb coucangi 




1511 


AF027131 


Mus musculus mucin 
glycoprotein wulj 
mRNA, partial cds 


9e-27 


2589 172 


norveuicus) 


2e-l4 


1512 


U49057 


RattuS norvegrcus 
CTD-binding SR-like 

nr/ , ifi»Irt f 40 mk MA 

complete cds 


5e-27 


1438534 


(049057} rA9 fRattus 
nor ve incus] 


le-04 


1513 


J03764 


Human, plasminogen 
activator inhibitor- 1 
gene, exons 2 to 9. 


3e-27 . 


<NONE> 


<NONE> 


<NONE> 


1514 


27S160 


M. musculus partial 
cochlear mRNA 
(clone 28D2) 


3e-27 


1490362 


(27S160) unknown [Mus 
musculus] 


2e-05 


1515 


Z64210 


H. sapiens CpG DNA, 
clone 99b4. reverse 
read cpg99b4.rtla . 


3e-27 


225753S 


(AB00453S) LIPOIC ACID 
SYNTHETASE 
PRECURSOR(LIP-SYN) 
S c h i zosa ceh aromy c e s pom b e ] 


le-06 
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SEQ 
ID 



Nearest Neighbor fBIastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor fBlastX vs. Non-Redundant Proteins) 



ACCESSION 



15 16 L35659 



Homo sapiens 
(subclone H8 6_h6 
from PI 35 H5CS) 
DNA sequence. 



DESCRIPTION 



ie-27 



<NONE> 



P VALUE 



<NONE> 



<NONE> 



15 17 1 AF100694 



Mus muse ui us 
Pontin52 mRNA, 
complete cds 



le-27 



1644471 



1518 AF100694 



Mus muse u I us 
Pontinf 2 mRNA, 
complete cds 



(U72686) odorant receptor 4 
[Danio reriol 



le-27 



2738388 



(AF003534) hypothetical 
protein 004L [Chilo iridescent 
virus) 



1519 AB00927I 



Homo sapiens gene 
for BCNT. partial cds 



le-27 



3880909 



1520 AF100694 



Mus musculus 
Pontin52 mRNA, 
complete cds 



(AL032636) Y40B1B.3 
[Caenorhabditis elegans] 



le-27 



2133579 



spermatophorin Sp23 - yellow 
mealworm molitor J 



7.5 



6.7 



4,6 



0.85 



1521 AF 100694 



vtus musculus 
Pontin52 mRNA, 
complete cds 



1522 AF100694 



vlus musculus 
ontin52 mRNA. 
comdete cds 



le-27 



121805 



ENDOGLUCANASE A 
PRECURSOR 



le-27 



3722000 



[AF035323) survival motor 
neuron protein [Bos taurus] 



0.5S 



0.10 



15231 AF100694 



1524 AF074382 



1525 AF100694 



VIus musculus 
Pomin52 mRNA, 
complete cds 



le-27 



3328188 



Homo sapiens IkB 
kinase eamma subuni 



le-27 



3641280 



1526 L78778 



1527 L03427 



152S U09954 



Mus musculus 
Pontin52 mRNA. 
complete cds 



le-27 



4056454 



Homo sapiens 
(subclone 2_el0 from 
PI H49) DNA 
sequence 



le-27 



225047 



Human zinc finger 
protein basonuclin 
mRNA. complete cds 



le-27 



1483275 



Human ribosomal 
protein L9 gene, 5' 
region and complete 
cds. 



4e-2S 



2257538 



(AF074902 ) laminin alpha chain 
[Caenorhabditis elegans] 



(AF074382) IkB kinase gamma 
subunit [Homo sapiens! 



(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis tha liana. 
ESTs gb|234165 and gb|2l8788 
come from this 2ene. 
[Arabidopsis thaliana] 



reverse transcriptase related 
protein [Homo sapiens] 



;U59694) zinc finger protein 
sasonuclin [Homo sapiens] 



(AB00453S) LIPOIC ACID 
SYNTHETASE 
PRECURSOR(LIP-SYN) 
Schizosaccharomyces pombe] 



0.08 



0.041 



6e-04 



2e-09 



9e-22 



2e-04 
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Nearest Neighbor < BlastN vs. Genbank) 


in cares i tNeiant 


)or (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1529 


Z64210 


H.sapiens CpG DNA, 
clone 99b4. reverse 
read cpa99b4.rtla - 


4e-28 


3878570 


(Z46381) similar to lipoic acid 
synthase; cDNA EST yk283b6.3 
comes from this gene; cDNA 
EST yk283b6.5 comes from this 
gene; cDNA EST yk472f5.3 
comes from this gene; cDNA 
EST yk472f5.5 comes from this 
gene; cDNA EST yk476e7.3... 


7e-ll 


1530 


U55177 


Danio rerio carbonic 
anhydrase homolog 
CAH-Z mRNA, 
complete cds 


4e-28 


3123190 


CARBONIC ANHYDRASE 
(CARBONATE 
DEHYDRATASE) >gi|2576335 
(U55 177) CAH-Z [Danio rerioj 


5e-2l 


1531 


D43632 


Human mRN A for 
very-long-chain acyl- 
CoA dehydrogenase 
(VLCAD), complete 
cds 


4e-28 


1351839 


ACYL-COA 

DEHYDROGENASE, VERY- 
LONG-CHAIN SPECIFIC 
PRECURSOR (VLCAD) 
>ei!930358 taurusl 


3e-^7 


1532 


AF01659I 


Homo sapiens 
survival motor neuron 
pseudogene. complete 
sequence 


3e-2S 


728831 


fm ALU SUBF.AMILY J 
WARNING ENTRY 


3e-08 


1533 


API 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


2e-28 


728832 


!!!! ALU SUBFAMILY SB 
WARNING ENTRY 


2.5 


1534 


AF1 00694 


VIus musculus 
Pontin52 mRN A, 
complete cds 


2e-28 


118588 


DEHYDRIN DHN3 
>gi|I00035|pir||S18139 dehydrin 
DHN3 - garden pea >gi|20709 
(X63063) pea dehydrin. DHN3 
[Pisum sativum) 


0.004 


1535 


AF 100694 


Mus musculus 
Pontin52 mRN A, 
complete cds 


2e-2S 


1 169643 


fMRFAiVI IDE-RELATED 
NEUROPEPTIDES 
PRECURSOR >gi|4 16208 
(U03137) neuropeptide 
precursor FMRFamide-related 
peptide [Lymnaea stagnalis] 


6e-04 


1536 


AF 1 00694 i 


Mus musculus 
Pomin52 mRNA, 
romplete cds 


2e-2S 


■ 

] 

4056454 | 


( AC005990) Contains repeated 
reeion with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z1878S 
:ome from this eene. 
Arabidopsis thaliana) 


9e-05 



WO 0102568 ffile://C:\Documents and Settinqs\Lawrence\Local Setting sVTemporary Internet Files\OLK11\WO0102568.cpc1 



Page 282 of 1046 



WO 01/02568 



PCT/US00/18374 



SEQ 
ID 



Nearest Neighbor ( BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 



ACCESSION 



DESCRIPTION 



ns repeated 



P VALUE 



1537 



API 00694 



Mus musculus 
Pontin52 mRNA, 
complete cds 



1538 



AF 1 00694 



Mus musculus 
Pontin52 mRNA, 
complete cds 



2e-28 



4056454 



region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34I65 and gbiZI8788 
come from this gene. 
[Arabidopsis thaliana] 



lops 

(ACWbySW) Contai ns repeated 
region with similarity to 
gb(U43627 extensin (atExtl) 
gene from Arabidopsis thaliana 
ESTs gb|Z34165 and gbiZ 18788 
come from this gene 
[Arabidopsis thaliana] 



2e-28 



4056454 



2e-06 



2e-09 



1539 



AF 1 00694 



Mus musculus 
Pontin52 mRNA, 
complete cds 



2e-28 



4056454 



(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gbjZ 18788 
come from this gene. 
Arabidopsis thaliana] 



Ie-09 



1540 



AF 100694 



vlus musculus 
Pontin52 mRNA, 
complete cds 



2e-28 



4056454 



AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs eb|Z34l65 and gb|Z 18788 
come from this gene. 
[Arabidopsis thatianal 



1541 



AF 100694 



vlus musculus 
Pontin52 mRNA, 
complete cds 



2e-28 



4056454 



1542 



AF100694 



Vlus musculus 
Pontin52 mRNA, 
complete cds 



2e-28 



3157926 



Vlus musculus 
? omin52 mRNA, 



1543 



AF100694 



complete cds 



le-2S 



<NONE> 



Vlus musculus 
ontin52 mRNA, 



1544 



AF 1 00694 



complete cds 



te-2S 



<NONE> 



1545 



AF 100694 



vlus musculus 
ontin52 mRNA. 
omplete cds 



(AC005990; Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z 18788 
come from this gene. 
[Arabidopsis thaliana] 



(AC0021 3 1) Strong similarity to] 
extensin-like protein gb|Z34465 
from Zea mays. [Arabidopsis 
thaliana) 



5e-10 



le-11 



8c- 12 



<NON'E> 



<NONE> 



<NONE> 



<NONE> 



Ie-28 



<NONE> 



<NONE> 



<NONE> 



is i 
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Nearest Neighbor [ BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


AL.L.fcoolUfN 






rt \_ v_ coo i Kjn 


DESCRIPTION 


P VAT T re 






Mus musculus 










1546 


API 00694 


Pontin52 mRNA. 
complete cds 


t "1 o 

le-28 


<NONE> 


<N(JNfc> 


<NONb> 


1547 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NUNh> 


<lNUlNb> 


<INUfNfc> 


1548 


AF 1 00694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


le-28 


<NUNb> 


<N(JNn> 


<IN(J£>*h> 


1549 


API 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONfc> 


1550 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1551 


API 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1552 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1553 


AF100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1554 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1555 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


ft o 

le-28 


<NONE> 


<NONE> 


<rs<jrsh> 


1556 


API 00694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


le-28 


<NONE> 


<NON£> 


<N(JNh> 


1557 


AF 1 00694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


1 ~ *>0 

le-is 


<iN(JNh> 


<iNUlNt> 


<iN UlN O 


1558 


API 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


Le-28 


<NONE> 


<NONE> 


<NONE> 


1559 


AF 100694 


Mus musculus 
Ponrin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1560 


AF 100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


le-28 


<NONE> 


<NONE> 


<NON~E> 


1561 


AF 100694 


Mus musculus 
Pomin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


< NO NE> 
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Nearest Neiahbor (BlastN vs. Genbank) 


Nearest Neiahbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Mus musculus 










1562 


AF100694 


Pontin52 mRNA, 
complete cds 


Ie-28 


<NONE> 


<NONE> 


<NONE> 


1563 


API 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1564 


AF 100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


Ie-28 


<NONE> 


<NONE> 


<NONE> 


1565 


AF 100694 


Mus musculus 
Pomin52 mRNA. 
complete cds 


Ie-28 


. <NONE> 


<NONE> 


<NONE> 


1566 


M37708 


Human simple repeat 
polymorphism. 


le-28 


<NONE> 


<NONE> 


<NONE> 


1567 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1568 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


3924779 


B; cDNA EST yk450dS.5 comes 
from this gene; cDNA EST 
yk249a6.5 comes from this 
gene; cDNA EST yk2 19a2.5 
comes from this gene; cDNA 
EST yk355e4.5 comes from this 
gene; cDNA EST ylc224f4.5 
comes fr... 

>gi|3924S8 IjgnljPlDle 1354569 
from this aene; cDNA EST 
yk249a6.5 comes from this 
gene; cDNA EST yk2l9a2.5 
comes from this aene; cDNA 
EST yk355e4.5 comes from this 
gene; cDNA EST yk224t"4.5 
comes from... 


3X) 


1569 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


1169643 


FMRF AMIDE-RELATED 
NEUROPEPTIDES 
PRECURSOR >ei|4 16208 
(U03137) neuropeptide 
precursor FMRFamide-relatcd 
peptide [Lymnaea stagnalis] 


0.66 
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Nearest Neighbor f BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












u.^VAfUJJjj Milium iu uimmu 




1570 


AF 100694 


* 

Mus musculus 
Pontin52 mRNA, 
complete cds 


le-23 


- 

* 

3924779 


■ D, cDNA E3T ^UjQUg.j luhiks 
from this gene; cDNA EST 
yk249a6.5 comes from this 
gene; cDNA EST yk2I9a2.5 
comes from this gene; cDNA 
EST yk355c4.5 comes from this 
gene; cDNA EST yk224f4.5 
comes fr... 

>gi|392438 1 |gnl|PED|e 1 354569 
from this gene; cDNA EST 
yk249a6.5 comes from this 
gene; cDNA EST yk2l9a2.5 
comes from this gene; cDNA 
EST yk355e4.5 comes from this 
gene; cDNA EST yk224f4.5 
comes from... 


0.65 


1571 


AF 100694 


Mus musculus 

Pontin52 mRNA, 

complete cds 

r — 


le-28 


2133579 


spermatophorin Sp23 - yellow 
mealworm molitor] 


0.49 


1572 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


Ic-28 


2133579 


spermatophorin Sp23 - yellow 
mealworm molitor I 


0.49 


1573 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


283446 


cyteine-rich surface antigen 72, 
CRP72 - Giardia iambtia 
(fragment) 


0.45 


1574 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


2493937 


SPERMATOPHORIN SP23 
PRECURSOR mealworm 
>gi|l61725 (M9292S) structural 
protein 


0.33 


1575 

■ 


AF100694 


vlus musculus 
Pontin52 mRNA. 
complete cds 


le-28 


1492050 


(U60315) MC107L [Molluscum 
contasiosum virus subtype 1] 


0. IS 


1576 


AF 100694 


vlus musculus 
Pontin52 mRNA, 
complete cds 


le-23 


2133579 


spermatophorin Sp23 - yellow 
mealworm molitor] 


o.oss 


1577 


AF 100694 


Vlus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


I1S588 


DEHYDRIN DHN3 
>gi|l00035|pir||S13139 dehydrin 
DHN3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 
[Pisum sativum] 


0.0 IS 


1573 


AF100694 « 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


1185SS 


DEHYDRIN DHN3 
>gi| 100035|pir||S I S 139 dehydrin 
DHN3 - garden pea >gi|20709 
[X63063) pea dehydrin DHN3 
Pisum sativum] 


0.016 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












DEHYDRIN DHN3 




1579 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


118588 


>gi|100035|p, r j|S 18139 dehydrin 
DHN3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 
[Pisum sativum] 


0.012 


1580 


AF 100694 


Mus muse ul us 
Pomin52 mRNA, 
complete cds 


le-28 


. 4056454 


(ACOUoWU) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|2I8788 
come from this gene. 
[Arabidopsis thaliana] 


0.010 


1581 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


118588 


DEHYDRIN DHN3 
>gi|!00035|pir[|S 18 139 dehydrin 
DHN3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 
[Pisum sativum] 


0.002 


1582 


AF100694 


Mus musculus 

Pontin52 mRNA, 

complete cds 

1 


le-28 


1169643 


FMRFAMIDE- RELATED 
NEUROPEPTIDES 
PRECURSOR >gi|4I620S 
(U03137) neuropeptide 
precursor FMRFamide-related 
peptide [Lymnaea stagnalis] 


O002 


1583 


- 

AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb[Z34165 and gbiZ 18783 
come from this gene. 
[Arabidopsis thaliana] 


0.002 


1584 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


1 18588 


DEHYDRIN DHN3 
>gi|l00035|pirj|SlSl39 dehydrin 
DHN3 - garden pea >gi 120709 
(X63063) pea dehydrin DHN3 
[Pisum sativum] 


0.002 


1585 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC U0;>9y0) Contains repeated 
reeion with similaritv to 

w s 

gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gbjZ34165 and gbjZ 18788 
come from this gene. 
[Arabidopsis thaliana] 


0.002 


1586 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


118588 


DEHYDRIN DHN3 
>gi[100035jpir||S IS 139 dehydrin 
DHN3 - garden pea >gij20709 
(X63063) pea dehydrin DHN3 
■Pisum sativum] 


0.00 i 



WOQ102568 ffile://C:\Pocuments and Settinqs\Lawrence\Local Settinqs\Tempor a rv Internet Files\0LK1 1\WO0102568.cpc] 



Page 287 of 1046 



WO 01/02568 PCT/USOO/18374 





Nearest Neighbor f BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(AC005990) Contains repeated 




1587 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|2 18788 
come from this gene. 
[Arabidopsis thaliana] 


0.001 


1588 


AF100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z 18738 
come from this gene. 
[Arabidopsis thaliana] 


6e-04 


1589 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC0O599O) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|234165 and sb|Z187SS 
come from this gene. 
[Arabidopsis thaliana] 


5e-04 


1590 


AF I 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005990) Contains repealed 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|234165 and gb|2l8788 
come from this sene. 
[Arabidopsis thalianal 


5e-04 


1591 


AF 100694 


VIus musculus 
Pomin52 mRNA, 
complete cds 


le-28 


118588 


DEHYDRfN DHN3 
>gi| 100035|pir||S 18139 dehydrin 
DHN.3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 
[Pisum sativum] 


2e-04 


1592 


AF 100694 


Vlus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC 005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb!2 13788 
come from this gene. 
Arabidopsis thaliana] 


2e-04 


1593 


AF 100694 « 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


< 

4056454 


(ACG05990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from .Arabidopsis thaliana. 
ESTs eb|Z34165 and gbjZlS788 
:ome from this aene. 
Arabidopsis thaliana] 


5e-05 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlasiX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(AC005990) Contains repeated 




1594 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb(Z34I65 and gb|Z 18788 
come from this gene. 
[Arabidopsis thaliana] 


5e-05 


1595 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


t ACU0:>99U) Contains repeated 
region with similarity to 
gb(U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|ZL8788 
come from this gene. 
(Arabidopsis thalianal 


le-05 


1596 


AF 100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


le-28 


4056454 


( AC005990) Contains repeated 
resion with similarity to 
gb[U43627 extensin (at£xti) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34!65 and gb|Z 18788 
come from this gene. 
[Arabidopsis thaliana] 


le-05 


1597 


AF 100694 


Mus musculus 
Pontin52 mRNA T 
complete cds 


te-28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from .Arabidopsis thaliana. 
ESTs gb|Z34L65 and gb(Z 18788 
come from this gene. 
[Arabidopsis thaliana! 


9e-06 


1598 


AF 100694 


Mus musculus 
Pomin52 mRNA, 
complete cds 


le-28 


4056454 


AC 005990 Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs ?b|Z34165 and sbiZ 18788 
come from this gene. 

Arabidopsis thaliana] 


6e-06 


1599 


AF 100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z18788 
come from thus gene. 
[Arabidopsis thaliana] 


5e-06 


[ 1600 1 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-2S 


544357 


RNA -BINDING PROTEIN 
FUS/TLS protein [human. 
Peptide. 526 aal [Homo sapiens! 


4e-06 
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E . 


Nearest Neighbor fBlasrN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Pre 


Keins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P V.ALUE 












(ACUUoyyO) Contains repeated 




1601 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|Z 18788 
come from this gene. 
{Arabidopsis thaliana] 


2e-06 


1602 


AFL 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28. 


4056454 


(AC005WO) Contains repeated 
region with similarity to 
gb|U43627 extensin (aiExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|234165 and gb|Z 18788 
come from this gene. 
(Arabidopsis thaliana] 


2e-06 


1603 


AF 100694 


Mus musculus 
Pomin52 mRNA, 
complete cds 


le-28 


4056454 


(ACQ0599U) Contains repeated 
region with similarity to 
gb|U43627 extensin (aiExt L) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Zl87S8 
come from this gene. 
[Arabidopsis thaliana] 


9e-07 


1604 


AF I 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005W0) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34 1 65 and gb|Zl878S 
come from this gene. 
[.Arabidopsis thaliana] 


8e-07 


1605 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


1169643 


*MRfAM!D£-ft£LAT£D 
NEUROPEPTIDES 
PRECURSOR >gi|4l620S 
(U03137) neuropeptide 
precursor FMRFamide- related 
peptide [Lvmnaea stagnalis] 


7e-07 


1606 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|Z187S8 
come from this gene. 
[Arabidopsis thaliana] 


6e-0" 


1607 


AFL 00694 


Mus musculus 
Ponttn52 mRNA, 
complete cds 


le-28 


4056454 


(AC0UO99O) Contains repeated 
region with similarity to 
sb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs eblZ34165 and ebiZlS7SS 
come from this gene. 
[Arabidopsis thaliana] 


5e-0~ 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(ACOODyyU) Contains repeated 




1603 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34I65 and gb|Zl8788 
come from this gene. 
[Arabidopsis thaliana] 


3e-07 


1609 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-2& 


4056454 


(AC00D990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34!65 and gb|Z!8788 
come from this sene. 
[Arabidopsis thaliana] 


. le-07 


1610 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
reeion with similarity to 
gb|U43627 extensin (atExtl) 
gene from .Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|Z 18788 
come from this sene. 
[Arabidopsis thaliana] 


Ie-07 


I6it 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34 165 and gb|Z 18788 
come from this gene. 
Arabidopsis thaliana) 


7e-08 


1612 


AFI00694 


VIus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from .Arabidopsis thaliana. 
ESTs gb|Z34 165 and gbjZI8788 
come from this cene. 
Arabidopsis thaliana] 


2e-08 


1613 


AF 100694 


VIus musculus 
Pomin52 mRNA, 
complere cds 


le-28 


i 

4056454 


(ACU059yO) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
sene from Arabidopsis thaliana. 
ESTs gb|Z34i65 and 2bjZlS788 
come from this gene. 
'Arabidopsis thaliana] 


6e-09 
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Nearest Neighbor (BlasuN vs. Genbank) 


Nearest Neiahbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












t ALOUD 990) Contains repeated 




1614 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


ICKHJii w i ii i inutility l »J 

gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|234l65 and gb|2 18788 
come from this gene. 
[Arabidopsis thaliana] 


5e-09 


1615 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 " 


4056454 


(AC005990) Contains repeated 

region wilu miiuiuj i iy lu 

gb|U43627 extensin (atExtl) 
gene from .Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|2l8788 
come from this gene. 
[Arabidopsis thaliana] 


4e-09 


1616 


AF 100694 


Mus musculus 
Pontin52 mRNA T 
complete cds 


lc-28 


4056454 


(AC005990) Contains repeated 
resion wnn simmiiiv iu 
gb|U43627 extensin (atExtl) 
gene from .Arabidopsis thaliana. 
ESTs gb|Z34165 andgb|Z!8788 
come from this gene. 
[Arabidopsis thaliana] 


7e-10 


1617 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC0O5990) Contains repeated 
region with similaritv to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|234I65 and gb!Z 18788 
come from this gene. 
[Arabidopsis thaliana] 


6e-10 


1618 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC0U599OJ Contains repeated 
reston with similarity to 
gb|U43627 extensin (atExtl) 
gene from .Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z 18788 
come from this gene. 
[Arabidopsis thaliana] 


5e-10 


1619 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC0U:j990) Contains repeated 
resion with similaritv to 
gb|U43627 extensin (atExtl) 
gene from .Arabidopsis thaliana. 
ESTs gb|234 1 65 and gb|Z 1878S 
come from this gene. 
[Arabidopsis thaliana] 


4e-l0 
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Nearest Netehbor i BlastN vs, Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(AC005990) Contains repeated 




1620 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thai i ana. 
ESTs gb|234I65 and gb|Z 18788 
come from this gene. 
[Arabidopsis thai i ana] 


2e-l0 


1621 


AF 100694 


Mus musculus 
PominSl mRNA, 
complete cds 


le-2S 


4056454 


(ACCKJ9990) Contains repeated 
region with similarity to 
gb|U43627 extensin (acExtl) 

o^>np t'rnm Arnhifinn?!^ rhnli/inn 
gcuc iivJiii ruHUiuuusid uiUiiiuiiu* 

ESTs gb|Z34165 and gb|2 18788 
come from this gene. 
[Arabidopsis thai i ana] 


5e-Ll 


1622 


AF 100694 


mus muscuius 
Pomin52 mRNA, 
complete cds 


le-28 


4056454 


(AC00D990) Contains repeated 

rpoinn with ^imil.iritv rr> 

gb|U43627 extensin (atExtl) 
gene from .Arabidopsis thaliana. 

CO 1 S gD|Z.J*+[Oj tlnu ^U[/-Jo/O0 

come from this aene. 
[Arabidopsis thaliana] 


2e-l2 


1623 


AF032896 


Petromyzon marinus 
polyadenylate binding 
protein 


le-28 


1082703 
— - 


polyadenylate binding protein II 
human 


2e-27 


1624 


AF 1 00694 


Mus muse ui us 
Pontin52 mRNA, 
complete cds 


9e-29 


118588 


DEHYDRIN DHN3 
>gi|100035|piri|Sl8I39 dehydrin 
utiiN j - gtiraen pc j .>gi|_u/u-* 
(X63063) pea dehydrin DHN3 
[Pisum sativum] 


0.013 


1625 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


9e-29 


2133579 


spermatophorin Sp23 - yellow 
mealworm molitor] 


6e-04 


1626 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


9e-29 


3376465 


(Z8I071) predicted usina 
Genefmder; Similarity to 
Human small nuclear 
ribonucleo protein E cDNA EST 
yk375g7.5 comes from this 
aene; cDNA EST vk435t'5.3 
comes from this sen... 


9e-06 


1627 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


8e-29 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from .Arabidopsis thaliana. 
ESTs sbjZ34165 and gb!Z1878S 
come from this gene. 
[Arabidopsis thaliana] 


2e-06 
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Nearest Neighbor {BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












Ai^-RiBOSYLATiON 




1623 


AF 100694 


Mus musculus 
Pontm52 mRNA. 
complete cds 


4e-29 


728883 


FACTOR 3 fruit fly (Drosophila 
melanogaster) >gi [507234 
(L25063) ADP ribosylation 
factor 3 [Drosophila 
melanogaster] 


0.016 


1629 


AF100694 


Mus muse ul us 
Pontin52 mRNA, 
complete cds 


4e-29 


544357 


RNA-BINDING PROTEIN 
FUS/TLS protein [human. 
Peptide. 526 aa] [Homo sapiens 


2e-07 


1630 


AF 1 00694 


Mus musculus 

PnnftnS^ mRNA 

complete cds 


4e-29 


• 

4056454 


(AC 003 990) Contains repeated 
region with similarity to 
sblU43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|Z 18788 

(Arabidopsis thaliana] 


le-08 


1631 




Human mRNA for 
very-long-chain acyl- 
CoA dehydrogenase 
(VLCAD), complete 

r*/tc 


4e-^9 


1 1 6R9R7 


ACYL-COA 

DEHYDROGENASE. VERY- 
LONG-CHAIN SPECIFIC 
PRECURSOR (VLCAD) 
dehydrogenase precursor - rat 
Acyl-CoA dehydrogenase 


UC"J / 


1632 


Y07660 


M.tuberculosis accBC 
gene 


4e-29 


2113935 


(Z95556) accDl 
[Mycobacterium tuberculosis] 


3e-47 


1633 


X55 367 


Wiirmn ilnhi-sntplltrc 

DNA from clone 
pTRA-2. 


le-29 


<NONE> 


<NONE> 


<NONE> 


1634 


L81866 


Homo sapiens 

(subclone 1 T 1 from 

PI H54) DNA 
sequence 


le-29 


<NONE> 


<NONE> ■ 


<NONE> 


1635 


S 75940 


{ Alu repeats, clone 
52H10} [human, 
colonic mucosa. 
Genomic. 943 ntl 


le-29 


728831 


Ml! ALU SUBFAMILY J 
WARNING ENTRY 


le-07 


1636 


AB001907 


Homo sapiens 
PACE4 eene. e.xon 1 3 


le-29 


72S831 


»"» ALU SUBFAMILY J 
WARNING ENTRY 


2e-09 


1637 


AF077003 


Mus musculus SH3 
domain-containing 
adapter protein 
mRNA. complete cds 


5e-30 


<NONE> 


<NONE> 


<NO.VE> 
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Nearest Neishbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
CD 


ACCESSION 




r VALUc 


ALCbooIUN 


nperp TPTTON 


r VALUE 












(AC0059yO) Contains repeated^ 




1638 


AF 100694 


Mus muse ul us 
Pontin52 mRNA, 
complete cds 


4e-30 


4056454 


region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb(Z34l65 and gb|Z 18788 
come from this gene. 
[Arabidopsis thaliana] 


3e-10 


1639 


M27072 


Xenopus laevis 
po!y( A)- binding 
protein (ABP-EF) 
mRNA, complete cds. 


4e-30 


" 1352709 


POL Y ADEN YLATE- 
BINDING PROTEIN 
polyadenylate-binding protein - 
African clawed frog Iaevisl 


5e-21 


1640 


X58386 


B.taurus mRNA for 
bovine vacuolar 
ATPase subunit A 


2e-30 


2773154 


(AF039573) abscisic acid- and 
stress-inducible protein 


4.3 


1641 


Y07660 


M. tuberculosis accBC 
gene 


le-30 


2113935 


(Z95556) accD 1 
[Mycobacterium tuberculosis] 


4e-47 


1642 


AJ236940 


Sus scrofa mRNA for 
hypothetical protein 
(5': clone 7C4) 


4e-3l 


4102021 


(AF007561) delta 6-desaturase 
[Borago officinalis] 


7.4 


1643 


AF039400 


Homo sapiens 
calcium-dependent 
chloride channel- 1 
(hCLCAl) mRNA. 
complete cds 


2e-3l 


3721912 


(AB017156) gob-5 [Mus 
musculus] 


7e-08 


1644 


L77036 


Homo sapiens 
(subclone 5_d9 from 
PI H19) DNA 
sequence. 


le-31 


461663 


BOMBYXIN B-2 HOiVIOLOG 
PRECURSOR silkmoth 
>gi|2l7385|gnl|PID|d 1003528 
(D 13924) Samia bombyxin 
homolos B-2 [Samia cynthia] 


l.l 


1645 


X61971 


H.sapiens mRNA for 
macropain subunit 
delta 


le-3I 


296734 


(X61971) macropain subunit 
delta [Homo sapiens] 


3e-06 


1646 


L00016 


human mitochondrial 
trnas and partial 
proteins 4 & 5; 
histidyl-, seryl-, 
leucyl-tma genes; 
urf4 and urf5 
(partial). 


5e-32 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z187SS 
come from this gene. 
[Arabidopsis thaliana] 


0.002 


1647 


M17887 


Human acidic 
ribosomal 
phosphoprotein P2 
mRNA. complete cds. 


5e'-32 


4056454 


(ACOODyyO) Contains repeated 
resion with similaritv to 
gb|U43627 extensin (atExt I ) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|Z 18788 
come from this gene. 
(Arabidopsis thaliana] 


le-05 



Pa 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor fBlasOC vs. No n- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


r% \ f a J T TV 

P VALUE 


ACCESSION 


DbiCRlPTION 


P VALUE 






Human mitogen- 










1659 


U53446 


responsive 

phosphoproiein DOC- 
2 mRNA, complete 
cds. 


6e-34 


3395443 


(AC004683) putative 
ammonium transporter. 3* partia 


4.7 


1660 


AFO 13988 


Homo sapiens serine 
protease mRNA, 
complete cds 


4e-34 


* 2507226 


PROTEIN-TYROSINE 
PHOSPHATASE EPSELON 
PRECURSOR (R-PTP- 
EPSILON) >gi| 1439605 
(U62387) protein ryrosine 
phosphatases [Mus musculus] 


3.2 


1661 


U53446 


Human mitogen- 
responsive 

phosphoprotein DOC- 
2 mRNA, complete 
cds. 


2e-34 


104757 


LEP100 protein precursor - 
chicken >2i|2 12254 aallusl 


1.6 


1662 


AJ233632 


Homo sapiens 
endogenous retroviral 
sequence ERV-L pol 
gene, clone ERV-L 
Human6 


2e-34 


3860513 


(AJ233597) reverse 
transcriptase (Mus famulus] 


4e-t0 


1663 


AF086310 


Homo sapiens full 
length insert cDNA 
clone ZD51F08 


8c-35 


2947070 


(AC002521) putative Ser/Thr 
protein kinase [Arabidopsis 
thaliana] 


2.3 


lO04 


X 17206 


Human mRNA for 
LLRep3 


3e-35 


730652 


m KWOSOMAL PRO 1 fclN 
S2 (STRINGS OF PEARLS 
PROTEIN) 

>gi|I08315S|pir||S50325 
ribosomal protein S3 - fruit fly 
[Drosophila melanogaster) 
melanogasterj >gi|5 15972 
,U01335) ribosomal protein S2 


2e-l0 


1665 


AB011137 


Homo sapiens mRNA 
for KIAA0565 
protein, complere cds 


3e-35 


3043654 


(ABO 1 1 137) KIAA0565 protein 
Homo sapiens] 


2e-l6 


1666 


U62801 


Human protease M 
mRNA. complete cds 


2e-35 


3929231 


^AF091247) potassium channel 
Rattus norveeicus] 


1.0 


1667 


AF020760 


Homo sapiens serine 
protease (Omi) 
mRNA, complete cds 


le-35 


273S9I5 


IAF020760) serine protease 
■Homo saptensl 


9e-U 
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Nearest Neighbor (Bias tN vs. Genbank) 


. Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


V« 

SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


| DESCRIPTION 


P VALUE 






Human DNA 










1668 


Z93943 


sequence from 
cosmid U235H3 on 
chromosome X 


8e-36 


1 196432 


(M22333) unknown protein 
[Homo sapiens 1 


3e-l0 


1669 


X06778 


Rabbit 18S rRNA 


7e-36 


118588 


DEHYDR1N DHN3 
>gi|i00035|pirj|S18I39 dehydrin 
DHN3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 
[Pisum sativuml 


0.0 1 1 


1670 


AB007962 


Homo sapiens 
mRNA, chromosome 
I specific transcript 
KIAA0493 


3e-36 


3329243 


(AE001350) hypothetical 
protein [Chlamydia trachomatis] 


3.1 


1671 


Z81014 


Human DNA 
sequence from 
cosmid U65A4, 
between markers 
DXS366 and DXSS7 
on chromosome X * 


3e-36 


141103 


HYPOTHETICAL PROTEIN 
ORF-1137 mouse 


0.03S 


1672 


Z810I4 


Human DNA 
sequence from 
cosmid U65A4, 
between markers 
DXS366 and DXSS7 
on chromosome X * 


3e-36 


198651 


(M29325) ORF1 [Mus 
muse ul us] 


0.006 


* 

1673 


U49082 


Human transporter 
protein (gl7) mRNA. 
complete cds 


3e-36 


1840045 


(U490S2) transporter protein 
Homo sapiens] 


2e-l5 


1674 


J03133 


Human transcription 
factor SPt mRNA. 3* 
end. 


3e-36 


477133 


HF-1 regulatory element binding 
protein - rat 


2e-3 1 


1675 


AB007934 


Homo sapiens mRNA 
for KIAA0465 
protein, partial cds 


le-36 


3413892 


(AB007934) KIAA0465 protein 
Homo sapiens] 


4c- 3 7 


1676 


M34857 


Mouse Hox-2.5 
mRNA. 


9e-37 


106296 


lorneotic protein Hox B9 - 
luman (fragment) 


0.15 


1677 


L35657 


Homo sapiens 
(subclone HS 5_al0 
from PI 35 H5 C8) 
DNA sequence. 


9e-37 


2072960 


(U93568) p40 [Homo sapiensl 


3e-05 


1678 


XS0240 


H.sapiens 
endogenous 
retrovirus HER V- 
KC4 DNA 


8e-37 


4185944 


;Y17S33) env protein [Human 
endogenous retrovirus Kl 


le-15 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neiffhbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human DNA 










1679 


293943 


^poupnce from 
cosmid U235H3 on 
chromosome X 


9e-38 


106322 


hypothetical protein (L1H 3' 
reeton) - human 


4e-13 


1680 


X97303 


H.sapiens mRNA for 
Pt2- 12 protein 


4e-38 


466044 


HtFU I Hh 1 1 LAX Z1NL 
FINGER PROTEIN ZK686.4 
IN CHROMOSOME III 
>gi|630780|pir||S44909 ZK686.4 
protein - Caenorhabditis elegans 
>gi|304346 (L 17337) coded for 
by C. elegans cDNAs 
GenBank:M88869 and T01933; 
putative [Caenorhabditis 
elesans] 


• 

3e-37 


1681 




H.sapiens mRNA for 

OOP— P IIKC piVlfclii 


3e-38 


3334339 


SOP2-LIKE PROTEIN 


5e-06 


1682 


Z62887 


H.sapiens CpG DNA, 

r\r\nf* TAofi forward 

read cps74i>6.ftia . 


2e-38 


1245686 


(U53181) F36D4.2 gene 
product [Caenorhabditis 
elegans 1 


0.19 


1683 


U35032 


Human endogenous 
retrovirus clone 
c5 11 HERV-H 
multiply spliced 
subgenornic leader, 
protease and iniegrase 
region mRNA, partial 
cds 


le-38 


59977 


(Z 143 10) tripartite fusion 
transcript PLA2L [Human 
endogenous retrovirus] 


le-06 


1684 


UoOy f 4 


Hnmnn mRNA for 

g X Lit 1 1*11 m * 1 Llx* ^ ** ^ v t 

KIAA0220 gene, 
partial lcls 


le-3S 


3337386 


(AC002544) Unknown gene 
product splice form-2 [Homo 
sapiens! 


8c- 11 


1685 


M31013 


Human nonmuscle 
myosin heavy chain 
(NMHC) mRNA. 3' 
end. 


le-38 


4115748 


(AB022023) nonmuscle myosin 
heaw chain B 


2e-ll 


1686 


AF006087 


Homo sapiens Arp2/3 
protein complex 
subunii p20-Arc 
(ARC20) mRNA, 
complete cds 


4e-39 


<NONE> 


<NONE> 


<NONE> 


1687 


X58374 


D.melanoeaster crn 
mRNA 


4e-39 


265588S 


(AL009171) 62D9.a 
[Drosophila melanoeaster] 


4e-42 


168S 


D85815 


Human DNA for 
rhoHPl. complete cds 


le-39 


134080 


GTP- BINDING PROTEIN 
TC10 ras-like protein [Homo 
sapiensf 


3e-26 
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Nearest Neighbor fBIastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1689 


U49057 


Rattus norvegicus 
CTD-binding SR-like 
protein rA9 mRNA, 
complete cds 


4e-40 


1438534 


(U49057) rA9 [Rattus 
norvegicus 1 


5e-05 


1690 


Y08999 


H.sapiens mRNA for 
Sop2p-Iike protein 


4e-40 


3334339 


SOP2-LIKE PROTEIN 


9e-08 


1691 


AB002293 


Human mRNA tor 
KIAA0295 gene, 
partial cds 


4e-40 


2224531 


(AB002293) KIAA0295 [Homo 
sapiens] 


le-30 


1692 


AF086222 


rtomo sapiens run 
length insert cDNA 
clone ZC66E08 


le-40 


2829669 


DOUBLE-STRANDED RNA- 
SPECEFIC ED IT AS E 1 
(DSRNA ADENOSINE 
DEAMINASE) (RNA 
EDITING ENZYME 1) 
>gi|1707502|gnl|PID|e254627 
(Ay922 /) double-stranded RNA- 
specific editase [Homo sapiens] 
editase i hREDl-L [Homo 
sapiens] >gt|2039_>00 (U7642 1) 
dsRNA adenosine deaminase 
DRADA2b [Homo sapiens] 


0.61 


1693 


ATU44 1 1 1 


Homo sapiens 
peroxisomal short- 
chain aiconoi 
dehydrogenase 
(SCAD-SRL) mRNA, 
complete cds 


le-40 


4105190 


'AF044127) peroxisomal short- 
chain alcohol dehydrogenase 


2e-06 


1694 


U36778 


Mus musculus Sil 
mRNA, complete cds 


le-40 


88608 


SIL protein - human >gi|338088 
(M74558) SIL 


6e-23 


1695 


U36778 


Mus musculus Sil 
mRNA. complete cds 


le-40 


88608 


SIL protein - human >gi|33808S 
(M74558) SIL 


6e-23 


1696 


U36778 


Mus musculus Sil 
mRNA. complete cds 


le-40 


88608 


SfL protein - human >gi|3380SS 
(M74558) SIL 


5e-23 


1697 


U36778 


Mus musculus Sil 
mRNA. complete cds 


le-40 


88608 


SIL protein - human >gi|3380SS 
(M74558) SIL 


5e-2? 


1698 


] 

AB018285 i 


Homo sapiens mRNA 
for KIAA0742 
protein, partial cds | 


le-40 


3882205 


[AB0182S5) KIAA0742 protein 
Homo sapiens) 


6e-3l 



297 
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Nearest Neighbor fBIastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION - 


DESCRIPTION 


P VALUE 












ATP- BINDING CASSETTE 




1699 


X75927 


Mmusculus abc2 
mRNA 


le-40 


728773 


TRANSPORTER 1 ABC I - 
human >gi|495257 (X75926) 
abcl [Mus musculus] 


3e-37 


1700 


AF038200 


Homo sapiens clone 
23954 mRNA 
sequence 


5e-41 


3211975 


(AF068195) putative 
glialblastoma cell differentiation- 
related protein [Homo sapiens] 


5e-14 


1701 


U20521 


Human estrogen 
sulfotransferase 
(i>TE) gene, exon 8 
and complete cds 


4e-41 


■ <NONE> 


<NONE> 


<NONE> 


1702 


AF026548 


Homo sapiens 
branched chain alpha- 
ketoacid 

dehydrogenase kinase 
precursor, mRNA, 
nuclear gene 
encoding 
mitochondrial 
protein, complete cds 


2c-41 


> 

3182923 


[3-METHYL-2- 
OXOBUTANOATE 
DEHYDROGENASE 
(LIPOAMIDE)] KINASE 
PRECURSOR alpha-ketoacid 
dehydrogenase kinase precursor 
[Homo sapiens] 


2e-09 


1703 


Y07660 


M. tuberculosis accBC 
gene 


2e-4l 


465847 


HYPOTHETICAL 66-5 KD 
PROTEIN F02A9.5 IN 
CHROMOSOME III 
>gi|2S054:|pir||S28313 
hypothetical protein F02A9.5 - 
Caenorhabditis elegans 
Gene tinder; similar to Propionyl 
CoA carboxylase beta chain; 
cDNA EST EMBL:M39018 
comes from this gene;cDNA 
EST EMBL:D28069 comes 
from this sene; cDNA EST 
EMBLD2S068 comes from this 
gene; cDNA EST ... 


3e-?8 


1704 


AG001237 


Homo sapiens 
genomic DNA, 21q 
region, clone: 
9HUN46 


le-41 


106322 


lypothetical protein (L1H 3' 
region) - human 


5e-09 


1705 


AB007934 


Homo sapiens mRNA 
for KIAA0465 
protein, partial cds 


le-41 


3413892 


(AB0079341 KIAA0465 protein 
Homo sapiens] 


3e-12 


1706 


AF055029 


Homo sapiens clone 
24711 mRNA 
sequence 


5e-42 


3250681 


AL0244S6*> putative protein 


2.2 
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Nearest Neiehbor f BlastN vs. Genbank) 


Nearest Neighbor fBIastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 1 DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 













I- 




1707 


Z49747 


O.cuniculus mRNA 
for phospho lipase C 


5e-42 


130227 


PHOSPHAI'IDYLINOSITOL- 

,* C QTCDLTACDl/ A TC 

4,5-d 15 r rtCJS PH A 1 1 
PHOSPHODIESTERASE 
DELTA 1 (PLC-DELTA- 1) 
(PHOSPHOLIPASE C-DELTA- 
1) (PLC-III) >gi| 163538 
(M20638) phospholipase C-in 
[Bos taurusl 


5e-36 


1708 


M93651 


Human set gene, 
complete cds. 


2e-42 


<NONE> 


<NONE> 


<NONE> 


1709 


AP36940 


Sus scrofa mRNA for 
hypothetical protein 
f 5'- clone 7C4"l 


2e-42 


2062403 


(U79010) delta 6 desaturase 
[Borago officinalis] 


8.5 


1710 


J03634 


Human erythroid 
differentiation orotein 
mRNA 


2e-42 


1708436 


INHIBIN BETA A CHAIN 
PRECURSOR 


2c- 10 


1711 


AJ2' > 3777 


Mus muse ul us mRNA 

tor itriatin 


6e-43 


2494917 


STRIAThN 

>gi|l495773lgnl|PID|c254158 


2e-32 


1712 


.AF016411 


Homo sapiens 
potassium channel 
subunit KCNA3.IB 


2e-43 


270S5I4 


(AF01641 1) KCNA3. IB [Homo 
sapiens] 


3e-l3 


1713 


AC001443 


Homo sapiens 
(subclone 2_t 10 from 
BAC 2913 


le-43 


1US14 


hypothetical protein 3 - rat 
>ai|565S9 


2e-06 


1714 


X32S95 


H.sapiens mRNA for 
DLG2 


6e-44 


2497511 


MAGUK P55 SUBFAMILY 
MEMBER 2 iMPP2 PROTEIN) 
(DISCS. LARGE HOMOLOG 
2) 


6eo2 


1715 


U 17077 


Human BENE 
mRNA. partial cds. 


3e-44 


53912 


(X57960) ribosomal protein L7 
[Mus musculusl >gi[554S9 


8e-30 


1716 


AJ222700 


Homo sapiens mRNA 
forTSC-22 protein 


2e-44 


<N0NE> 


<NONE> 


<NONE> 


1717 


J03634 


Human erythroid 
differentiation protein 
mRNA 


2e-44 


124279 


iiNHiBKN HhlA ALHAifli 
PRECURSOR PROTEIN) 
(EDF) >gi|87936|pir||B2424S 
inhibin beta* A chain precursor - 
human >gi|lSl947 ^03634) 
erythroid differentiation protein 
precursor [Homo sapiens] 
sapiens! 

>gi|22bS50|prl1|160S260B 
inhibin beta A [Homo sapiens] 


0.73 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neiehbor f BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















• 

1718 


AB014518 


Homo sapiens mRNA 
for KIAA0618 
protein, complete cds 


7e-45 


1911548 


(S80864) cytochrome c-like 
polypeptide sapiens) 


1.6 


1719 


X76808 


H.sapiens genomic 
DNA clone d2 


7e-45 


868201 


(U29380) similar to adenylate 
cyclase [Caenorhabditis eleeansj 


2e-09 


1720 


AB021288 


Homo sapiens mRNA 
for beta 2- 
microglobulin,. 
complete cds 


2e-45 


2465521 


(U95995) RNA-dependent RNA 
polymerase [Cryptosporidium 
parvuml 


0.15 


1721 


X63468 


Rsapiens mRNA for 
transcription factor 
TFIIE alpha 


8e-46 


<N0NE> 


<NONE> 


<NONE> 


1722 


AF0 19226 


Homo sapiens D2-2 
mRNA. 3'UTR 


7e-46 


<NONE> 


<NONE> 


<NONE> 


1723 


D31764 


Human mRNA for 
KIAA0064 gene, 
complete cds 


2e-46 


3123050 


HYPOTHETICAL PROTEIN 
KIAA0064 


lcl5 


1724 


K02774 


Human MHC class II 
HLA-DR-beta-psi 
(DW4/DR4) 
pseudogene. exons 
3,4, 5,6, clones cosll- 
3301 and cosII-80L 


le-46 


4185946 


( Y 17S34) gag protein [Human 
endogenous retrovirus K] 


2e-14 


1725 


X92109 


H. sapiens hcelX gene 


9e-47 


2498185 


BRIl3£ OF S£V£NLESS 
PROTEIN PRECURSOR 
>gi|1079J66|pir||A47550 bride 
of sevenless precursor - fruit fly 
(Drosophila virilis) >gi|2902I6 
virilisj 


1.4 


1726 


X93334 


H.sapiens 

mitochondrial DNA. 
complete genome 


8e-47 


128753 


NADH-UBIQUINONE 
OXIDOREDUCTASE CHAIN 
4>gi|86696|pir)|A00435 NADH 
dehydrogenase (ubiquinone) 


4e-l5 


1727 


M 85 145 


Human tumor 
necrosis factor 
receptor. 3' flank. 


3e-47 


<NONE> 


<NONE> 


<NONE> 


1728 


X80240 


H.sapiens 
endogenous 
retrovirus HERV- 
KC4 DNA 


3e-47 


4185944 


(Y17S33) env protein [Human 
endogenous retrovirus Kl 


7e-lS 


i 

1729 


Z63594 


H.sapiens CpG DNA. 
clone S7t9. forward 
readcpgS7l:'9.ftla . 


le-47 


3322743 


(AE001222) T. pallidum 
predicted coding region TP0454 


2.4 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) . 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






R.rattus mRNA for 










1730 


X62295 


vascular type- 1 
angiotensin II 
receptor 


4e-48 


1209756 


(U43629) integral membrane 
protein [Beta vulgaris) 


Ic-07 


1/31 


M85145 


Human tumor 
necrosis factor 
receptor, 3' flank. 


3e-48 


^NONE> 


<NONE> 


<NONE> 


1 Til 

1732 


AB020712 


Homo sapiens mRNA 
for KIAA0905 
protein, complete cds 


4e-49 


4240299 


(AB020712) KIAA0905 protein 
[Homo sapiensl 


2e-20 


1733 


AB020712 


Homo sapiens mRNA 
for KIAA09O5 
protein, complete cds 


3e-49 


4240299 


(AB0207I2) KJAA0905 protein 
[Homo sapiens] 


2e-20 


1734 


X62295 


R.rattus mRNA tor 
vascular type- 1 
angiotensin II 
receptor 


le-49 


1209756 


(U43629) integral membrane 
protein [Beta vulgaris j 


7e-12 


1735 


AJ007509 


* 

Homo sapiens mRNA 
for ElB-55kDa- 
associated protein 


le-49 


3319956 


(AJ007509>EiB-55kDa- 
associated protein 


i 

4e-24 


1736 


X97303 


H.sapiens mRNA for 
Ptg-12 protein 


le-49 


466044 


FINGER PROTEIN ZK686.4 
IN CHROMOSOME III 
>gi|6307S0|pirf|S44909 ZK686.4 
protein - Caenorhabditis elegans 
>gi|304346 (X 17337) coded for 
by C. elegans cDNAs 
GenBank;MS8869 and T01933; 
mutative [Caenorhabditis 
elesans] 


! 

i 

\ 

1 

3e-3* ' 


1737 


AF038404 


Homo sapiens 
lomolog of Neddi 
(hNeddf) mRNA, 
complete cds 


4e-50 


<NONE> 


<NONE> 


<NON 


1738 


L43618 


Homo sapiens 
polycystic kidney 
disease (PKDl) gene, 
exons 35-42 


4e-50 


90375 S 


(L43619) polycystic kidney 
disease 1 protein [Homo 
sapiens] 


3e-' 


1739 


AF009424 


4omo sapiens clone 
22 mRNA, alternative 
splice variant alpha- 1, 
complete cds 


4e-50 


2271473 


< AF009426) clone 22 [Homo 
sapiens] 


5c 
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Nearest Neiehbor ( BlastN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



SEQ 
ID 



ACCESSION 



DESCRIPTION 



P VALUE 



ACCESSION 



DESCRIPTION 



P VALUE 



monosaccharid transport protein 



1740 



1741 



1744 



1745 



1746 



1747 



1748 



1749 



750 



1751 



L77040 



-iomo sapiens 
(subclone S_cl 1 from 
PI H22) DNA 
sequence 



2e-50 



99758 



STP4 - Arabidopsis thaliana 
>gi| 16524 (X66857) sugar 
transport protein [Arabidopsis 
thaltanal 



6.4 



L35657 



■Iomo sapiens 
(subclone H8 5_al0 
romPI 35 H5 CS) 
DNA sequence. 

-Iomo sapiens CTG7a 
mRNA. partial cds 
3ovine mRNA tor 
p97. partial cds 



2e-50 



2072960 



(U93568) p40 [Homo sapiens! 



2e-05 



<NONE> 



M22960 



Human protective 
protein mRNA. 
complete cds. 



le-50 



131081 



olv p rotein [Sindbis virus] 
■SOiViALPRO'IhCl'lV 

PROTEIN PRECURSOR 
(CATHEPSIN A) 
( C ARB OXYPEPT IDAS E C) 
human >gi|l902S3iM22960) 
protective protein precursor 



9.9 



le-12 



X3601S 



U03495 



H. sapiens mRNA for 
MUF1 protein 

Human transcription 
factor LSF-ID 
mRNA. complete cds 



le-50 



1082610 



7e-51 



2136296 



mufl protein - human 
>gi|762953 (XS601S) mufl 
[Homo sapiens] 



transcription factor LSF - human 
gi|476099 . 



le-2 1 



lc-21 



ABO 15344 



Homo sapiens 
HRIHFB2157 
mRNA. partial cds 



5e-5l 



3970874 



(AB015344) HRIHFB2157 
[Homo sapiens] 



2c-3 



M93339 



Human zinc finger 
protein mRNA. 



4e-5l 



30241 10 



MYC- ASSOCIATED ZINC 
FINGER PROTEIN sapiens] 



2e-06 



U71363 



Human zinc finger 
protein zfp6 (2F6) 
mRNA. partial cds 



4e-5l 



2689441 



(AC0O36S2) F1S547_1 [Homo 
sapiens) 



2e-ll 



X56932 



H. sapiens mRNA for 
23 kD highly basic 
protein^ 



4e-5l 



730451 



6dSttlBO50MALPROlfcIN" 
L13A (23 KD HIGHLY BASIC 
PROTEIN) 

>gi|?45S97|piri!S29539 basic 
protein. 23K - human >gi|2369l 
(X56932) 23 kD highly basic 
protein [Homo sapiens] 



Z79054 



H. sapiens flow -sorted 
chromosome 6 
Hindi 11 fragment. 
SC6pA21Ell 



2e-51 



<NONE> 



<NONE> 



<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor f BlastX vs. Non-Redundant Proteins) 


SEQ 
ED 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Homo sapiens 










1752 




BAF60b gene, partial 
sequence 


JC Ji 






<IN\JlNb> 


1753 




Sus scrofa mRNA for 
hypothetical protein 
, cione 4Doj 






RIBONUCLEOPROTEIN 
RB97D ribonucleoprotein 
lurosopnila meianogaster] 


A 1 
*¥. i 


1754 


AF003693 


Mus musculus 
scaffold protein Pbpl 

complete cds 


6e-53 


2197106 


^/\jtuujov scaiioiu protein 
Pbpl homolog [Mus musculus] 


2e-54 


1755 


M27319 


Human calmodulin 
mRNA. complete cds. 


5e-53 


115528 


CALMODULIN 
>gi|102408(pir||JC1309 
calmodulin - Stylonychia lemnae 
(SGC5)>gi|161195 


0.002 


1756 


M / 4 j 3 j 


Mouse house-keeping 
protein mRNA, 
complete cds. 






house-keeping protein - mouse 
>Si|19jS7 1 


je-jO 


1757 


X92720 


H.sapiens mRNA for 
phosphoeno! pyruvate 
carboxvkinase 


6e-54 


2135915 


phosphoeno 1 pyruvate 
carboxvkinase (GTP) (EC 
ha.l.jj.) precursor, 
mitochondrial - human 
carboxvkinase (GTP) [Homo 
sapiens 1 


6e-2I 


1758 


nrUU /oil 


Homo sapiens torsinB 
(DQl) mRNA, partial 
cds 


sc- J4 


Z / OU 11 { 


(AB002405) LAK-4p [Homo 
sapiens) 


U — 


1759 


U49507 


Mus musculus 
B6CBA Lisch7 
mRNA, partial cds. 


2e-54 


1236083 


(U49507) Lisch7 [Mus 
musculus] 


3e-27 


1760 


Z73360 


Human DNA 
sequence from 
cosmid 92M18 t 
BRCA2 aene resion 

rhrAmAcnmr 1 I "?n 1 

13. 


le-55 


2370371 


( Y 14657) hydrophobin 
Pleurotus ostreatus] 
>gi|29S2620|gnl|PID|e 1283986 

/\j»«juui / run. ii> lii ujjinjuiii 

Pleurotus ostreatus] 


2.0 


1761 


U83702 


Human cytochrome c 
oxidase subunit Via 
gene, exon 3 and 
complete cds 


8eo6 


2982994 


(AEO0O6S2) hypothetical 
protein [Aquifex aeolicus] 


7.0 


1762 


Y127S1 


-k>mo sapiens mRNA 
for transducin (beta) 
ike 1 protein 


7e-56 


3021409 


(Y127S 0 transducin (beta) like 
1 protein [Homo sapiens] | 


7e-39 



-7 



)0 
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Nearest Neighbor (BlastN vs. Genbank) 



SEQ 
ID 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



P VALUE 




1763 



1764 



1765 



1768 



1769 



1770 



1771 



1772 



AB020673 



4omo sapiens mRNA 
or KIAA0866 
protein, complete cds 



1773 



1774 



AJ236932 



Sus scrofa mRNA for 
hypothetical protein 
(5*: clone 4B8) 



L06900 



-luman dystrophin 
jene, intron 1 
containing pseudo 
ex on. 



1766 X93334 



1767 AF064856 



AF081484 



H. sapiens 

mitochondrial DNA, 
complete genome 
Rattus sp. 7acomp 
Drotein mRNA. 
complete cds 

Homo sapiens alpha- 
tubulin isoform 1 
mRNA, complete cds 



X71427 



AF0 13988 



U25691 



X93334 



D3S255 



U25691 



Homo sapiens mRNA 
for FUS-CHOP 
protein fusion 



Homo sapiens serine 
protease mRNA. 
complete cds 



Mus musculus 
lymphocyte specific 
helicase mRNA. 
complete cds 



H. sapiens 

mitochondrial DNA, 
complete genome 



Homo sapiens mRNA 
for CAB I, complete 
cds 



Mus musculus 
lymphocyte specific 
helicase mRNA. 
complete cds 



8e-57 



3e-57 



le-58 



9e-59 



3e-59 



2e-59 



le-60 



Ie-60 



7e-6i 



4e-61 



4e-6l 



8e-62 



2104553 



AF001548) Myosin heavy 
chain (MHYll) (5partial) 
Homo sapiensl 



400927 



RIBONUCLEOPROTEIN 
RB97D ribonucleoprotein 
Drosophila melanogaster] 



4185129 



(AC005724) unknown protein 
Arabidopsis thalianal thaliana] 



1492050 



3169626 



32015 



(U60315) MC107L [Molluscum 
contagiosum virus subtype 1] 

(AF064856) "acomp protein 
Rattus sp-1 



(X06956) alpha-tubulin [Homo 
sapiens! 



746557 



2564316 



2137490 



70656 



2135214 



2137490 



(U23523) histidine-rich 
[Cae norhabditis elegans] 



(AB006622) No similarities to 
any reported proteins [Homo 
sapiens] 



lymphocyte specific helicase 
mouse musculus] 



ubiquitin/ ribosomal protein 
S27a - human extension protein 
HUBCEPS0 [human. Peptide, 
156 aa] ubiquitin extention 
protein [Cavia porcellusl 



gene MLN 64 protein - human 



lymphocyte specific helicase - 
mouse musculus] 



4e-04 



4.7 



7.0 



2e-3l 



4e-22 



0.45 



0.26 



3e-25 



9e-0S 



4e-23 



Se-26 
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Nearest Neighbor f BlastN vs. Genbank) 


Nearest Neiehbor (Bias lK vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


















M21731 


Human lipocortin-V 
mRNA. complete cds. 


6e-62 


3212603 


Human Annexin V With Proline 
Substitution By Thioproline 


2e-20 


1776 


AF02 1936 


Rattus norvegicus 
myotonic dystrophy 
kinase- related Cdc42- 
binding kinase 
MRCK-beta (MRCK- 
beta) mRNA, 
complete cds 


2e-62 


2736153 


/ a t~t\i i a ~> /i \ „ , _ ■ 

(AFU21936) myotonic 
dystrophy kinase-related Cdc42- 
binding kinase MRCK-beta 
[Rattus norvegicus] 


3e-27 


1777 


Y 12059 


H.sapiens HUNKI 
mRNA 


le-62 


3184498 


(AC0O4798) R31546_l [Homo 
sapiens] 


3e-09 


1 11 o 

I 7 /o 


L37368 


Human (clone E5.1) 
RNA-binding protein 
mRNA. complete cds. 


6e-63 


477578 


sialidase - Actinomyces viscosus 
>si|141852 


7.8 


1779 


M27S77 


Fisure 2. Nucleotide 
and translated protein 
sequences of HPFI, - 
2, and -9. 


5e-63 


1731443 


ZINC FINGER PROTEIN S3 
(ZINC FINGER PROTEIN 
HPFi) >gi|l06023|pir||A3289t 
finger protein U placental - 
human 


3e-33 


1780 


AF09544S 


Homo sapiens 
putative G protein- 
coupled receptor 


2e-63 


3116131 


(AL0232S8) hypothetical 
protein 


4.6 


1781 


L 19437 


Human transaldolase 
mRNA containing 
transposable element, 
complete cds 


2e-63 


1^^3l 19 


(U63159) transaldolase [Mus 
museums] 


1 _ l o 

4e-lS 


1782 


L41351 


Homo sapiens 
prostasin mRNA. 
complete cds 


le-63 


2833277 


PROSTASIN PRECURSOR 
precursor - human >gi|862305 
(L4135I) prostasin [Homo 
sapiens] >gi|i 143194 (U33446) 
prostasin [Homo sapiens] 


6e-I4 


1783 


AF053470 


Homo sapiens lOkD 
protein (BC10) 
mRNA. complete cds 


6e-64 


482237 


hypothetical protein K03H1.9 - 
Caenorhabditis elesans 


0.029 



hoi 
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Nearest Neighbor / BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Pre 


iteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1784 


D3779I 


Mouse mRNA for 
beta- 1 A- 

palactosy [transferase 


6e-64 


• 

3880102 


IZVJ>390J sirninrro fyve zinc 

finger; cDNA EST yk265b4.5 
comes from this gene; cDNA 
EST vk359e9.5 comes from this 
gene; cDNA EST yk319c2.5 
comes from this gene 
fCaenorhabditis elegans] zinc 
finger; cDNA EST yk265b4.5 

r>/-\m#»c fmm thic ©ene* cDNA 

EST yk359g9.5 comes from this 
gene;cDNA EST yk3l9c2.5 
comes from this gene 
[Laenorhabcntis elegans j 


3e-16 


1785 


AF0 15770 


Mus musculus radical 
fringe (radical- fringe) 
mRNA, complete cds 


6e-64 


2204355 


(U94350) radical fringe 
precursor [Mus musculus] 


le-36 


1786 


Z79054 


H.sapiens flow-sorted 
chromosome 6 
Hindi II fragment. 
SC6pA21Ell 


2e-64 


<NONE> 


<NONE> 


<NONE> 


1787 


M83094 


Homo sapiens 
cytosolic selenium- 
dependent glutathione 
peroxidase gene, 
complete cds, and 
rhoh 12 eene, 3' end. 


le-64 


2447063 


(U42580) A565R [Paramecium 
bursaria Chlorella virus H 


8.8 


1788 


Y 10211 


H.sapiens LAG- 3 
sene, promoter region 


7e-65 


1944540 


(XI 4 1 12) tegument protein 
[human herpesvirus 1] 


2.3 


1789 


M 19045 


Human lysozyme 
mRNA, complete cds. 


2e-65 


<NONE> 


<NONE> 


<NONE> 


1790 


U01SS2 


Homo sapiens SS- 
A/Ro autoantisen 52 
kda component gene, 
complete cds 


2e-65 


585401 


LlPASLMOULLAlUk 
PRECURSOR (LIPASE 
HELPER PROTEIN) 
>gi|480045ipir||S36249 lipB 
protein - Pseudomonas glumae 
>gi|49207 (X70354) helper 
protein 


4.2 


1791 


AF069517 


Homo sapiens RNA 
binding protein DEF- 
3 mRNA. complete 
cds 


2e-65 


3212101 


(AF069517) RNA binding 
protein DEF-3 [Homo sapiens] 


le-25 
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Nearest Neighbor 'BlastN vs. Genbank) 


Nearest Neiahbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Homo sapiens jerky 










1792 


AF004715 


gene product 
homolog mRNA, 
compteie cds 


2e-65 


2314829 


( AF0047 15) jerky gene product 
homolog [Homo sapiens! 


2e-45 


1793 


X59652 


C. longicaudatus hprt 
mRNA for 
hypoxamhine 


3e-66 


631625 


hypoxaruhme (guanine) 
phosphoribosyltransferase - long- 
tailed hamster 
phosphoribosyltransferase 
[Cricetulus longicaudatus 1 


6c-54 


1794 


U94350 


Mus muscutus radical 
fringe precursor 
mRNA. complete cds 


3e-67 


2204355 


(U94350) radical fringe 
precursor [Mus musculus] 


2e-33 


1795 


AF0158H 


Mus musculus 
putaiive 

lysophosphatidic acid 
acyltransferase 
mRNA. complete cds 


3e-68 


2317725 


(AF0i581 1} putative 
lysophosphatidic acid 
acyltransferase [Mus musculus 1 


7e-51 


1796 


J03 1 37 


Cow 

phosphoinositide- 
specific 

phospholipase C 


3e-69 


226908 


phospholipase C 154 [Bos 
taurus] 


3e-25 


1797 


AF044574 


Rattus norvegicus 
putative peroxisomal 
2.4-dienoyI-CoA 
reductase (DCR- 
AKL) mRNA. 
complete cds 


le-69 


4105269 


(AF044574) putative 
peroxisomal 2,4-dienoyl-CoA 
reductase [Ratms norvegicus] 


2e-33 


1798 


AF015SII 


Mus musculus 
putative 

lysophosphatidic acid 
acyltransferase 
mRNA. complete cds 


4e-70 


2317725 


(AF015811.) putative 
lysophosphatidic acid 
acyltransferase [Mus musculus] 


3c- 19 


1799 


X65157 


M.musculus mRNA 
for desmoyokin. 
partial 


5e-74 


109781 


desmoyokin - mouse (fragment) 
>gii50675 


9e-37 


1800 


Z97207 


Mus musculus mRNA 
for B-IND1 protein 


2e-74 


2231019 


(Z97207) B-IND1 protein [Mus 
musculusl 


6e-2l 


1801 


U27196 


Gallus sallus zinc 
finger protein (Fif-l) 
mRNA. complete cds. 


6e-75 


984814 


(U27196) zinc finger protein 
[Gallus gallus] gallus] 


2e-44 
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• 


Nearest Neinhbor fBIastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












70 KD WD- REPEAT TUMOR- 




1802 


Y 15054 


Rattus norvegicus 
mRNA for 70 kDa 
tumor specific 
antisen. partial 


3e-77 


3123027 


SPECIFIC ANTIGEN 
>gi|2505957|gnl|PfD|e353992 
(Y 15054) 70 kD tumor-specific 
antigen [Rattus norvegicus] 


4c-42 


1803 


X65157 


M.musculus mRNA 
for desmoyokin. 
partial 


3e-79 


109781 


desmoyokin - mouse (fragment) 
>gi|50675 


9e-33 


1804 


U50736 


Rattus norvegicus 
cardiac adnamycin 
responsive protein 
mRNA. complete cds 


2e-84 


1362781 


cytokine inducible nuclear 
protein C 193 - human 
>gi|79384l (XS3703) nuclear 
protein [Homo sapiens] 


7e-30 


1805 


AF072865 


Rattus norvegicus 
thiorcdoxin reductase 
(TrxR2) mRNA. 
nuclear gene 
encoding 
mitochondrial 
protein, complete cds 


2e-84 


3757888 


(AF072S65) thiorcdoxin 
reductase [Rattus norvegicus] 


6e-43 


1806 


AF044574 


Rattus norvegicus 
putative peroxisomal 
2,4-dienoyl-CoA 
reductase (DCR- 
AKL) mRNA. 
complete cds 


6e-S5 


4105269 


(AF044574) putative 
peroxisomal 2,4-dienoyl-CoA 
reductase [Rattus norvegicus] 


le-41 


1807 


U19I81 


Rattus norvegicus 
Rabin3 mRNA, 
complete cds. 


2e-S7 


624225 


(Ul9181)Rabin3 [Rattus 
norveaicus] 


2e-4i 


1808 


U40342 


Mus musculus ninein 
mRNA. complete cds. 


le-91 


1113865 


(U40342) ninein [Mus 
musculus] 


2e-36 


1809 


X67877 


R. norvegicus mRNA 
for cytosolic 
resiniferatoxin- 
bindins protein 


4e-92 


136077 


TROPOMYOSIN BETA 3, 
FIBROBLAST chicken 
>gi|5l5694 (M23082) 
tropomyosin [Gallus callus] 


0.56 


1810 


AF0-J4574 


Rattus norvegicus 
putative peroxisomal 
2,4-dienoyl-CoA 
reductase (DCR- 
AKL) mRNA. 
complete cds 


5e-93 


4105269 


(AF044574) putative 
peroxisomal 2.4-dienoyl-CoA 
reductase [R;ittus norvegicus] 


ieoO 


1311 


AF035527 


Mus musculus EHF 
(Ehf) mRNA. 
complete cds 


2e-95 


3138930 


(AF035527)EHF [Mus 
musculus] 


2e-47 



1>0 4> 
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SEQ 
ID 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



18121 ABO 16930 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



Cricetulus griseus 
mRiNA for 
Phosphatidylglycerop 
hosphate synthase, 
complete cds 



6e-96 



4159682 



P VALUE 



(AB016930) 

Phosphatidyl glycerophosphate 
synthase [Cricetulus griseus] 



7e-4l 



1813 1 AB005549 



Rattus norvegicus 
mRNA for atypical 
PKC specific binding 
protein, complete cds 



7e-97 



3868778 



(AB005549) atypical PKC 
specific binding protein [Rattus 
norvegicus] 



3e-4I 



1814 X90849 



1815 S79873 



1816 U67203 



1817 L14684 



G. gallus PBI sene 



2e-97 



h-lamp-2=lysosome- 
associated membrane 
protein-2 protein-2b 
(LAMP2) mRNA, 
alternatively spliced 
form h-lamp-2b, 
complete cds. 



2134381 



polybromo 1 protein - chicken 
chicken >gi|95I23l (X90849) 
polybromo 1 protein [Gallus 
gallus] 



le-34 



3e-98 



<NONE> 



<NONE> 



Mus musculus ACF7 
neural isoform 1 
(mACF7) mRNA, 
partial cds 



<NONE> 



2e-98 



Rattus norvegicus 
nuclear-encoded 
mitochondrial 
elongation factor G 
mRNA, complete cds 



ISIS I X84692 



M. musculus Spnr 
mRNA for RNA 
binding protein 



1819 1 U50736 



1820 S66855 



Rattus norvegicus 
cardiac adriamvcin 
responsive protein 
mRNA. complete cds 



HoxB9=Hox-2.5 
[mice, embryos, 
mRNA Partial, 786 
ml 



1675224 



U67204) ACF7 neural isoform 
2 [Mus musculus] 



9e-39 



e-100 



585084 



e-133 



1363238 



e-II3 



1362781 



e-107 



1708355 



ELONGATION FACTOR G, 
MITOCHONDRIAL 
PRECURSOR (MEF-G) 
>gi|543383|pir||S40780 
translation elongation factor G, 
mitochondrial ■ rat >gi|3l0I02 



2e-30 



spermatid perinuclear RNA- 
binding protein Spnr • mouse 
>gi|673454 (X84692) spermatid 
perinuclear RNA binding 
protein [Mus musculus] 



5e-35 



cytokine inducible nuclear 
protein C193 - human 
>gi|793841 (XS3703) nuclear 
protein [Homo sapiens] 



2e-36 



HOMEOBOX PROTEIN HOX- 
B9 (HOX-2.5) 



Se-37 



%0<\ 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 






P VAI IJP 




DESCRIPTION 


P VALUE 






HoxB9=Hox-2.3 










is? i 


S66855 


[mice, embryos, 
mRNA Partial. 7oo 
ntl 


e-108 


1708355 


B9 (HOX-2^ 


4e-37 


1822 


U92072 


Rattus norvegicus m- 
tomosyn mRNA, 
complete cds 


e-102 


3790389 


(U92072) m-tomosyn [Rattus 
norveeicusl 


2e-38 


1823 


D 1 7577 


Mouse mRNA for 
kinesin-Like protein 
(Kiflb), complete cds 


e-129 


2497524 


KJNESIN-UKE PROTEIN 
KJP1B mouse 

>gi|407339|gnl|PIDfd 1005029 
(D 17577) Kiflb [Mus 
musculus) 


2e-39 


1824 


AF062484 


Mus muscutus SDP8 
mRNA, complete cds 


e-122 


3126981 


(AF062484) SDP8 [Mus 
musculusl 


5e-40 


1825 


X73683 


R. norvegicus mRNA 
for hi stone H3.3 


e- 109 


122075 


(H3.3Q) histone H3.3 - fruit fly 
(Drosophila melanogaster) 
histone H3.3B - chicken 
>gi|2ii9023!pir||S6I2lS histone 
H3.3 - fruit fly (Drosophila 
hydei) I-I36 1 ! [Oryctolagus 
cuniculus] >gii8046 (X53S22) 
Histone H3.3Q gene product 
[Drosophila melanogaster] 
>gij5H9S gallus] >gijl61190 
(M17876) histone H3 [Spisula 
solidissima] >gi|2 1 1 S53 
(MU393) histone 3.3 [Gallus 
gallus] >gi|30684S (MI 1354) 
H3.3 histone [Homo sapiens] 
melanogaster] >gi|96303 I 
(XSI205)his;one H3.3 H3.3A 
variant [Drosophila 
metanosastcr] musculusl 


2e-40 


1826 


U67203 


Mus musculus ACF7 
neural isoform 1 
(mACF7) mRNA, 
partial cds 


e-102 


1675224 


(U67204) ACF7 neural isoform 
2 [VI us musculus] 


2e-40 


1827 


D 17577 


Mouse mRNA for 
kinesin-like protein 
(Kiflb), complete cds 


e-13! 


2497524 


KINESIN-LIKE PROTEIN 
KIF1B mou>e 

>gi|407339^ -I|PID|d 1 005029 
(D17577) K::Ib [Mus 
musculus] 


7e-42 



?>(0 
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Nearest Neighbor f BlastN vs. Genbank) 


Nearest Neishbor fBIastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1828 


ABO 169 30 


Cricetulus griseus 
mRNA For 

Phosphatidylglycerop 
hosphate synthase, 
complete cds 


e-131 


4159682 


(ABO 16930) 

Phosphatidyl glycerophosphate 
synthase [Cricetulus griseus] 


3e-43 


1829 


U09874 


Mus musculus SKD3 
mRNA. complece cds. 


e-I22 


2493735 


SKD3 PROTEIN SKD3 [Mus 
musculus] 


7e-48 


1830 


X99145 


Cfamiliaris mRNA 
forC3VS protein 


e-110 


1429314 


(X99145) overex pressed in 
thyroid tissue after TSH 
stimulation [Canis familiaris] 


2e-49 


1831 


X99836 


P.walti mRNA for 
rnp associated protein 
55 


e-106 


4200286 


(X99836) rap55 (Pleurodeles 
waltl] 


2e-50 


1832 


AF077003 


Mus musculus SH3 
domain-containing 
adapter protein 
mRNA. complete cds 


e-121 


3550240 


(AF077003) SH3 domain- 
containing adapter protein; 
CD2AP 


3e-51 


1833 


AF060246 


Mus muscuius strain 
C57BLY6 zinc finger 
protein 106 (Zfpl06) 
mRNA, H3a-a allele, 
complete cds 


c-118 


3372657 


(AF060246) zinc finger protein 
106 (Mus musculus] 


le-52 


1834 


Z 14030 


^.norvegicus mRNA 
: or TRAP-comple.x 
gamma subunit. 


e-120 


1 174453 


1KANSCUCUN- 
ASSOCIATED PROTEIN, 
GAMMA SUBUNIT (TRAP- 
GAMMA) (SIGNAL 
SEQUENCE RECEPTOR 
GAMMA SUBUNIT) (SSR- 
GAMMA) 

>gi|423l85|pirj|S33294 
translocon-asiociated protein 
samma chain - rat norveeicus] 


7eo4 


1835 


AF077003 


VI us musculus SH3 
domain-containing 
adapter protein 
mRNA. complete cds 


e-132 


3550240 


(AF077Q03) SH3 domain- 
containing adapter protein; 
CD2AP 


5e-54 


1836 


L20427 


*attus norvegicus 

dihydroxypolyprenylb 

enzoate 

methyltransferase 
mRNA. complete cds 


e-116 


457372 


IL20427) 

dihydroxypolyprenylbenzoate 
methvltransrerase 

* 

dihydroxypolyprenylbenzoate 
methyltransferase [Rattus 
norvegicus| 


4e-56 



3h 
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Nearest Neighbor rBlastN vs. Genbank) 


Nearesi Neiahbor (BlastX vs. Non- Redundant Proteins) 


SEQ 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












PROTEIN TSG24 (MEIOTIC 




1837 


XS0169 


M.musculus mRNA 
for 200 kD protein 


e-122 


1717793 


CHECK POINT 
REGULATOR) 
>gi|1083553|pir||A551 17 tse24 


2e-56 


1838 


AF080568 


Rattus norvegicus 

CTPrphosphoethanol 

amine 

cy tidy lyl transferase 
mRNA. complete cds 


e-119 


3396102 


(AF080568) 

CTP:phosphoethanolamine 
cvtidvlvltransferase 


6eo8 


1839 


X99I45 


C.familiaris mRNA 
for C3 VS protein 


e-121 


1429314 


(X99145) overexpressed in 
thyroid tissue after TSH 
stimulation [Canis familiar is] 


2e-53 


1840 


AFO 19075 


Pan troglodytes breast 
and ovarian cancer 
susceptibility 
(BRCAl)gene, 
partial cds 


e-145 


2213154 


( AF005068) breast and ovarian 
cancer susceptibility protein 
splice variant [Homo sapiens] 


leoS 


1841 


U55042 


Bos taurus myosin X. 
complete cds 


e-122 


1755049 


<U55042) myosin X [Bos 
taurus] 


le-61 


1842 


AJ0077S0 


Mus musculus mRNA 
for poly(ADP-ribose) 
poIymerase-2 


e-119 


3283975 


(AF072521) poty-(ADPrtbosyl). 
transferase homolos PARP 


4e-62 


1343 


AF072865 


Rattus norvesicus 
thioredoxin reductase 
(TrxR2) mRNA, 
nuclear gene 
encoding 
mitochondrial 
protein, complete cds 


e-105 


3757888 


(AF072865) thioredoxin 
reductase [Rattus norvegicusl 


3e-62 


1844 


U55042 


Bos taurus myosin X, 
complete cds 


e-I21 


1755049 


(U55042) myosin X [Bos 
taurus] 


le-62 


1 O.IC 

1845 


X61506 


Mouse E46 mRNA 
for E46 protein 


e-139 


114909 


BRAIN PROTEIN E46 


9e-67 


1846 


D90335 


Bovine mRNA for 
GTP-binding protein 
aipha-subunit 


e-L4S 


585 174 


GUANINk MjCLkOllDk- 
BINDING PROTEIN, ALPHA- 
14 SUB UN IT (GL1) 
>gi 1 1 087 1 1 !piri| A4089 1 GTP- 
Dinding protein GL I alpha chain 
- bovine protein, alpha-subunit 
Bos taurus ] 


2e-69 


1847 


U49507 


Mus musculus 
B6CBA Lisch7 
mRNA. partial cds. 


e-140 


2121326 


IAC00212S ) Lisch7 [Homo 
sapiens] 


2e-74 
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Table 4 





Nearest Neighbor iBIascN vs. Genbank) 


Nearest Neighbor tBlastX vs. Non-Redundant Proteins) 


SEQ 














ID 


ALL too IUIN 


L/CoLKLr I t\JP* 


r VALUt 


ALL CoolL/lN 


L/C O LT i ivyi^ 


P W AI I TP 

r v.ALUC 


i 
1 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<N0NE> 


<NONE> 


"> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<N0NE> 


<NONE> 


i 

j 


<NONE> 


<NONE> 


_ « |/~V V lf™*_ 


<NOtNc> 


<.\ONE> 


<NONE> 


.i 
■+ 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


j 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NONc> 


<Nur*t> 


0 


<NONE> 


<NONE> 


<NONfc> 


<NONE> 


0»UiNb> 


<. r NL/I > *t> 


J 

/ 


<NONE> 


<NONE> 


<NONE> 


<NONE> 




<lNCJNt> 


Q 

O 


<NONE> 


<NONE> 


<NONE> 


<NONc> 




<iNur»t> 


V 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<xjr s «b> 


<INL?t > «t> 


1U 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<NUNfc> 


<iNUINb> 


1 1 

1 1 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<>ONh> 


<MUNh> 


IZ 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<>tUf N *t> 


<INUiNt> 


1 1 
I J 


<NONE> 


<N0NE> 


<NONE> 


<N0NE> 


<NONE> 


<NUNb> 


14 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


^_ x * "X 7 T ^ 


<NOl>TE> 




<NONE> 


<NONE> 


<NONE> 


<N0NE> 


<NONE> 


<NONE> 


16 


<NONE> 


<N0N£> 


<NONE> 


<N0NE> 


<NONE> 


<NONE> 


17 


<NONE> 


<NONE> 


<NONE> 


<NONE> 


<.\UNt> 




to 


<NONE> 


<NONE> 


<NONE> 


<N0NE> 


<NONE> 


<NONE> 


19 


<NONE> 


<N0NE> 


<N0NE> 


<N0NE> 


<>CJNfc> 




J.V 


<NONE> 


<NONE> 


<NONE> 


<N0NE> 


<NONE> 


<NON t> 


21 


<NONE> 


<NONE> 


<NONE> 


<N0NE> 


<>ONh> 


<i\L)Nh> 


***** 


<NONE> 


<NONE> 


<N0NE> 


<N0NE> 


<NONh> 


<NL)Nb> 












lMDC I protein - crab-eating 




0 1 


<NONE> 


<NONE> 


<N0NE> 


1079469 


macaque 


9.J 












(AB01 1 IjS) KIAAU^ot) protein 




14 


<NONE> 


<NONE> 


<NONE> 


304j6o6 


[Homo sapiens] 














potassium channel protein k\j - 






<NONE> 


<NONE> 


<NONE> 




rat protein [Rauus ncrvegicus] 


vS.O 












(AF091^63j oltactorv receptor 




26 


<NONE> 


<NONE> 


<N0NE> 


3769624 


(Rattus norvesicus] 


1 .1 












(Z81517) F2SB1.6 




27 


<NONE> 


<N0NE> 


<N0NE> 


3S76443 


fCaenorhabditis elegansl 


7.1 












(AB00I6S4) ORF249 (Chlorella 




23 


<NONE> 


<NONE> 


<NONE> 


2224464 


vulgaris] 


0." 












(Uo/940) ORrvegluo; random 














Cl/iNA ScuuerKc [i-^iLi\UiLcnuii. 






<NONE> 


<NONE> 


<NONE> 


1519707 


discoideun;] 


6.7 












protein kinase C II [Xenopus 




30 


<NONE> 


<NONE> 


<N0NE> 


227491 


laevis] 


67 












C50C3.4 protein - 




31 


<NONE> 


<NONE> 


<NONE> 


630575 


Caenorhabditis eiegans 


6.0 












35 KD PROTEIN IN RNA2 














clover necrotic mosaic virus 














>v;ii&1466 iXOSOJ H ORF for 35 














k Da polype pi : dc ( A A 1-317; 














[Red clover necrotic mosaic 






<NONE> 


<NONE> 


<N0NE> 


157290 


virus) 


6.0 



3£3 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlascX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(Xl67ll)pid:g3004l [Homo 




33 


<NONE> 


<NONE> 


<NONE> 


30041 


sapiens 1 


5.9 


34 


<NONE> 


<NONE> 


<NONE> 


2493585 


CELL DIVISION PROTEIN 
FTSW 


5.7 


35 


<NONE> 


<NONE> 


<NONE> 


1001450 


(D63999) hypothetical protein 


5.7 


36 








3182918 


NITROGEN REGULATORY 
PROTEIN AREA 


5.2 


37 




<iN KJl^i c*** 




14001 1 


mitochondrial 

RIBOSOMAL PROTEIN S5 
Emericella nidulans 
mitochondrion (SGC3) 
>gi|12709 nidulans] >gi|472822 
fjon90> unknown Drotein 


43 


38 


<NONE> 


<NONE> 


<NONE> 


3979943 


(ALU34J9JJ predicted using 
Genefinder; similar to WD 
domain, G-beta repeat; cDNA 
EST yk362f7.5 comes from this 
gene; cDNA EST yk362f7.3 
comes from this gene 
[Caenorhabduis elegans] 


4.0 


39 








950203 


(U31329) polyketide synthase 
f Aspergillus terreusl 


3.3 


40 


<NONE> 


<NONE> 


<NONE> 


3560232 


(AL03I530) hypothetical zinc 
finger protein 

[Schizosaccharomyces pombel 


3.0 


41 


<NONE> 


<NONE> 


<NONE> 


730071 


AXONEME-ASSOCIATED 
PROTEIN MST 101(1) product 
[Drosophila hydei] 


2.6 


42 


<NONE> 


<NONE> 


<NONE> 


2506641 


HYPOTHETICAL 21.7 KD 
PROTEIN IN INTE-PIN 
INTERGENIC REGION 
>gi| 1787402 (AE000214) orf, 
hypothetical protein 
{Escherichia coli] 


2.5 


43 


<NONE> 


<NONE> 


<NONE> 


3511232 


(AF071556) anthranilate 
dioxygenase large subunit 


2.4 


44 


<NONE> 


<NONE> 


<NONE> 


1 150900 


(U43139) envelope glycoprotein 
gpl20 [Human 

immunodeficiency virus type 1] 


1.9 


45 


<NONE> 


<N0NE> 


<NONE> 


3876099 


(275536) similar to dynein 
heavy chain; cDNA EST 
EMBL:D27549 comes from this 
gene; cDNA EST 
EMBL.D34859 comes from this 
aene (Caenorhabditiselegans] 


1.4 



3h 



W00102^ 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlasLX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(AL032647) predicted using 




46 


<NONE> 


<NONE> 


<NONE> 


3881150 


Genefinder 


1.4 


A 

47 


<NONE> 


<NONE> 


<NONE> 


132200 


COLANIC ACID CAPSULAR 
BIOSYNTHESIS 
ACTIVATION PROTEIN A 
>gi|95605|pir||Sl770l rcsA 
protein 


1.1 


48 


<NONE> 


<NONE> 


<NONE> 


2204286 


(U61380) germination protein 
[Bacillus meaaterium] 


1.0 


49 


<NONE> 


• 

<NONE> 


- 

<NONE> 


1723955 


HYPU1 HhfiUAL 11.4 KL> 
PROTEIN IN FOX I -KJEX I 
INTERGENIC REGiON 
>gi|2132566|pir|[S64222 
probable membrane protein 
YGL204c - yeast 
(Saccharomyces cerevisiae) 
>gi|l 322838|gnl|PID|e243803 
(Z72726) ORF YGL204c 
[Saccharomyces cerevisiaei 


0.84 


50 


<NONE> 


<N0NE> 


<NONE> 


3201564 


( AJ0065 1 4) pro I i poprote i n 
diacylglyceryl transferase 
[Vibrio cholerae] 


0.31 


51 


<NONE> 


<NONE> 


<NONE> 


2808721 


(AL0CI42S) hypothetical 
protein Rv0064 


0.27 


52 


<NONE> 


<NONE> 


<NONE> 


602434 


(U 17986) GABA/noradrenaline 
transporter [Homo sapiens] 


0-13 


53 


<NONE> 


<NONE> 


<NONE> 


3347955 


(AF0761S4) cytosolic sorting 
protein PACS- lb [Rattus 
norvegicus] 


0,12 


54 


<NONE> 


<NONE> 


<NONE> 


1255S87 


(Udjj44; coaea lor oy 
elegans cDNA yk92b4.5: coded 
tor bv C elesans cup/a 
yk73al.5; coded for by C. 
elegans cDNA ykl02e9.5; 
coded for by C, elegans cDNA 
yk7lc8.5; coded for by C. 
elegans cDNA yk66dl 1.5; 
coded for by C. elegans cDNA 
yk66c3,.. 


0.074 


55 


<NONE> 


<NONE> 


<NONE> 


103076 


Bkm-like se.vdeterminins 
region hypothetical protein 
CS314 - fruit fly (Drosophila 
meianojiaster) 


0.005 


56 


<NONE> 


<NONE> 


<NONE> 


107560 


Ras inhibitor (clone IC265) - 
human sapiens] 


0.002 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












Bkm-Iike sex-determinina 




57 


<NONE> 


<NONE> 


<NONE> 


103076 


region hypothetical protein 
CS3 14 - fruit fly (Drosophila 
melanogaster) 


2e-04 


58 


<NONE> 


<NONE> 


<NONE> 


2702370 


(AF038604) contains similarity 
to Drosophila ovarian tumor 
locus protein (GB:X13693) 
[Caenorhabditis elegans] 


6e-05 


59 


<NONE> 


<NONE> 


<NONE> 


3859713 


(AL033501) phox domain 
protein [Candida albicans] 


3e-05 


60 


<NONE> 


<NONE> 


<NONE> 


2088839 


(AF003386) F59E12.5 gene 
product [Caenorhabditis 
elegans] 


2e-08 


61 


<NONE> 


<NONE> 


<NONE> 


121059 


GC-R1CH SEQUENCE DNA- 
BINDING FACTOR GCF - 
human >gi| 1 79412 (M29204) 
DNA-binding factor [Homo 
sapiens] 


4e-09 


62 


<NONE> 


<NONE> 


<NONE> 


3875246 


(Z814yU) similar to WU 
domain, G-beta repeats (2 
domains); cDNA EST 
EMBL.T00482 comes from this 
aene; cDNA EST 
EMBL:T00923 comes from this 
gene; cDNA EST yk449d4.3 
comes from this gene; cDNA 
EST yk449d4.5 comes from this 
sen . 


9e-24 


63 


<NONE> 


<NONE> 


<NONE> 


1465834 


(U64S57) No definition line 
found [Caenorhabditis elegans] 


9e-28 


64 


<NONE> 


<NONE> 


<NONE> 


3327136 


(AB0I4561) KIAA0661 protein 
[Homo sapiens] 


le-29 


65 


<NONE> 


<NONE> 


<NONE> 


3880433 


(Z6652I) similar to 
mitochondrial RNA splicing 
MSR4 like protein; cDNA EST 
EMBL:C09217 comes from this 
gene [Caenorhabditis elegans] 


8e-31 


66 


D42I33 


Rat annex in V gene, 
exon7 and exon8 


5.0 


<NONE> 


<NONE> 


<NONE> 


67 


L35679 


-fomo sapiens 
(subclone H8 2_dll 
from PI 35 H5 C8) 
>NA sequence. 


5.0 


10S6902 


(U412/S)coded tor byC. 
elegans cDNA yk'79«8.5; coded 
for by C. elegans cDNA 
cmlOcS; coded for by C elegans 
cDNA yk79gS.3; similar to 
leucinerrich repeats found in 
many proteins [Caenorhabditis 
eiesans] 


6.6 



yip 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






HIV- 1 strain BX220 










68 


U90184 


from USA, envelope 
glycoprotein C2V3 
region (env) gene, 
partial cds 


5.0 


1297070 


(Z7 1986) convicilin precursor 
[Vicia narbonensis! 


6.6 


69 


U61465 


Human myosin Vila 
(MY07A) gene. 5* 
exon 37 


5.0 


2313225 


(AE000535) L-lactate permease 
(IctP) [Helicobacter pylori 
26695] 


5.0 


70 


AF0I3717 


Homo sapiens 
periplakin (PPL) 
mRNA, partial cds 


5.0 


' 3719238 


(AF064869) brain-enriched 
guanylate kinase- associated 
protein 2; BEGA2 {Rattus 
norvegicus] 


3,8 


71 


X58245 


Soybean mRNA for 
HMG-1 like protein 


5.0 


2995363 


(AL022245) biofin synthase 


0.99 


72 


AF 102425 


Frasera paniculata 
tRNA-Leu (trnL) 
gene, intron, 
chloroplast sequence 


4.9 


3522958 


(AC004411) putative 
pectinesterase [Arabidopsis 
thaliana] 


6.4 


V 

73 


X82817 


H.sapiens 

PTPlOHCP- variant 
aene 


4.9 


3875514 


(Z8I4y4j cUNA'hM" 
EMBL:D27474 comes from this 
gene; cDNA EST 
EMBL:D27473 comes from this 
gene; cDNA EST 
EMBL.T00471 comes from this 
gene; cDNA EST 
EMBL:D34192 comes from this 
gene; cDNA EST 
EMBUD37241 comes from this 
gene; ... 


2.8 


74 


U04827 


Mus musculus brain 
fatty acid-binding 
protein 


4.9 


3676132 


(AL031765) 1- 

evidence=predicted by content; 
1 -method=genefinder;084; 1 - 
method_score=3 1 .96; 1- 
evidence_end; 2- 
evidence=predicted by match; 2- 
match_accession=SPTREMBL: 
Q93319;2- 

match_description=HYPOTHE 
TICAL PROTEIN C33A1 1 .2.;... 


2e-09 


75 


AF03SS59 


Veospora hughesi 
strain NE 1 internal 
ranscrtbed spacer 1, 
complete sequence 


4.8 


<NONE> 


<NONE> 


<NOKE> 



2A1 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
m 

1JL/ 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






M.musculus MFH- 1 










76 


Y08222 


gene 


4.8 


<NONE> 


<NONE> 


<NONE> 


77 


AJ224475 


Borrelia burgdorferi 
left chromosomal 
subtelomeric region 
(pfpB gene) 


4.8 


4218141 


(AJ236702) HMR1 protein 
(Antirrhinum majus] 


8.3 


78 


U02486 


Mus musculus LAP 
putative membrane 
protein (KRAG) 
gene, exon 3 and 
complete cds 


4.8 


3258103 


(AP000006) 367aa long 
hypothetical protein 
[Pyrococcus horikoshti) 


2.7 


79 


AB0002S0 


Rat mRNA tor 
peptide/histidine 
transporter, complete 
cds 


4.8 


806317 


(M29067) unknown protein 
[Saccharomyces cerevisiae] 


0.001 


80 


Z4977 1 


A.cepa mitochondria] 
gene for NADH 
dehydrogenase 
subunit 3 and 
ribosomal protein 
S12 


4.5 


<NONE> 


<NONE> 


<NONE> 


81 


M63494 


Mouse IgG receptor 
(beta-Fc-gamma-RII) 
gene, exons 6 and 7, 
clones lambda- 
Fc(3.2.93). 


4.3 


<NONE> 


<NONE> 


<NONE> 


82 


Z 14035 


S. pom be carl gene 


2.0 


3790665 


(AF099000) No definition line 
found (Caenorhabdilis elegans] 


1.2 


83 


U 17 129 


Rhodococcus 
erythropolis ThcA 
(thcA) gene, complete 
cds; and unknown 
penes 


2.0 


2828280 


(AJL021687) putative protein 
[Arabidopsis thaliana] 
>gi|2832633|gnl|PID|el249651 
(AJL0217U) putative protein 
[Arabidopsis thaliana] 


ze--o 


84 


AE001386 


Plasmodium 
falciparum 
chromosome 2, 
section 23 of 73 of 
the complete 
sequence 


2.0 


4176500 


(AL03U77) dJ889Ml5.3 {novel 
protein) 


9e-59 


85 


U79292 


Human clone 23734 
mRNA sequence 


1.9 


<NONE> 


<NONE> 


<NONE> 


86 


V00159 


Chloroplast Euglena 
gracilis eene coding 
for the 5S and 16S 
rRNA. 


1.9 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Xenopus laevis XL- 










87 


U95094 


INCENP (XL- 
INCENP) mRNA, 
complete cds 


1.9 


<NONE> 


<NONE> 


<NONE> 


88 


X93206 


H.salinarium TATA 
box-binding protein 
genes and ORFs 


1.9 


<NONE> 


<NONE> 


<NONE> 


89 


U60979 


Caenorhabditis 
elegans programmed 
cell death specifier 
(ces-2) gene, 
complete cds 


1.9 


<NONE> 


<NONE> 


<NONE> 


90 


X56272 


C. tentans ORFs (A- 
E) for hemoglobin 


1.9 


<NONE> 


<NONE> 


<NONE> 


91 


L22383 


Homo sapiens DNA 
sequence, repeat 
resion. 


1.9 


<NONE> 


<NONE> 


<NONE> 


92 


U82814 


Hirudo medicinatis 
neuron- specific 
protein mRNA, 
complete cds 


1.9 


3822533 


(AF094531) immunoglobulin 
heavy chain precursor 


2.0 


93 


U 18504 


Haplomitrium 
hookeri 18S rRNA 
gene, partial 
sequence. 


1.9 


1083969 


hypothetical protein 6 - fowlpox 
virus virus] 


2.0 


94 


X53676 


Pseudomonas stutzeri 
nosDFY genes 
involved in copper 
processing 


1.9 


2980781 


(AL02219S) putative protein 


0.70 


95 


U60086 


Dictyostelium 
discoideum multidrug 
resistance 
transporter/Ser 
protease (tagC) 
mRNA. complete cds. 


1.9 


3879530 


(Z49130) cDNA EST 
yk486b9.3 comes from this 
gene; cDNA EST yk4S6b9.5 
comes from this gene 


6e-05 


96 


U33447 


Human purarive G- 
protein-coupled 
receptor (GPR17) 
gene, complete cds 


1.9 


3880034 


(Z75550) similar to cell division 
control protein [Caenorhabditis 
elegans] 


7e-14 


97 


M81327 


Sus scrofa lactoferrin 
mRNA. complete cds. 
>:; gb|I2S42l|I2842l 
Sequence 5 from 
patent US 5571691 


1.8 


<NONE> 


<NONE> 


<NONE> 



2>R 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






S.iniae IctP & IctO 










98 


Y07622 


genes and ORF1 


1.8 


<NONE> 


<NONE> 


<NONE> 


99 


M60474 


Mouse myristoylated 
alanine-rich C-kinase 
substrate (MARCKS) 
mRNA, complete cds. 


1.8 


<NONE> 


<NONE> 


<NONE> 


100 


Y13901 


Homo sapiens FGFR- 
4 gene 


1.8 


<NONE> 


<NONE> 


<NONE> 


101 


U44400 


Human Down 
Syndrome region of 
chromosome 21, 
clone A3 1D6-1D6. 


1.8 ' 


<NONE> 


<NONE> 


<NONE> 


102 


U92808 


Ruminococcus albus 
beta-glucosidase 
(gluA) mRNA, 
complete cds 


1.8 


<NONE> 


<NONE> 


<NONE> 


103 


L25051 


Candida albicans 
argininosuccinate 
lyase (ARG4) gene, 
complete cds. 


1.8 


<NONE> 


<NONE> 


<NONE> 


104 


AE000546 


Helicobacter pylori 
26695 section 24 of 
134 of the complete 
genome 


1.8 


<NONE> 


<NONE> 


<NONE> 


105 


J00978 


Xenopus laevis major 
bcta-globin gene, 
complete cds. 


1.8 


<NONE> 


<NONE> 


<NONE> 


106 


U4I7I6 


Human 

immunodeficiency 
virus type 1 isolate 
JW95-5, vpr gene, 
complete cds. 


1.8 


<NONE> 


<NONE> 


<NONE> 


107 


X66286 


G.gallus mRNA for 
tens in 


1.8 


<NONE> 


<NONE> 


<NONE> 


108 


U76636 


Xenopus calbindin 
D28k mRNA, 
complete cds 


1.8 


<NONE> 


<NONE> 


<NONE> 


109 


J00664 


rabbit embryonic beta 
4-alobin aene. 

£ ■» — — 


1.8 


<NONE> 


<NONE> 


<NONE> 


no 


M21535 


Human erg protein 
(ets-related gene) 
mRNA. complete cds. 


1.8 


2983160 


(AE000693) hypothetical 
protein [Aquifex aeolicus] 


7.7 



WOQ102568 lfil e://C:\Documents and Settinqs\Uwrence\Local,Se^ 



Page 32 2 of 1046 



WO 01/02568 



PCT/US00/18374 





Nearest Neighbor (BlastN vs. Cenbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Rat troponin T 






(Z46595) incomplete interleukin 




in 


M80829 


cardiac isoform gene, 
complete cds 


1.8 


999450 


1 1 receptor isoform [Homo 
sapiens] 


7.3 


112 


D37887 


Cyprinus carpio c- 
myc gene for c-Myc, 
complete cds 


1.8 

, , - .. 


3023408 


BRANCHED-CHAIN AMINO 
ACID TRANSPORT SYSTEM 
CARRIER PROTEIN 
(BRANCHED-CHAIN AMINO 
ACID UPTAKE CARRIER) 
>gi[1075007|pir||D64056 
membrane-associated 
component, branched amino 
acid transport system (bmQ) 
homoloc ~ Haemophilus 
influenzae (strain Rd KW20) 
system II carrier protein (brnQ) 
[Haemophilus influenzae Rd] 


7.2 


113 


AF019765 


Homo sapiens u 
protein-coupled 
receptor kinase 1 and 
G protein-coupled 
receptor kinase lb 
(GRKl)2ene, 
alternatively spliced, 
alternative exon 6, 
exon 7, and partial 
cds 


1.8 


498643 


(U10270) G-box binding factor 
1 [Zea mays] 


7.2 


114 


AF025967 


Helicobacter pylori 
3166 virulence 
regulon 
transcriptional 
activator homo log 
gene, partial cds, 
strain-specific 
genomic sequence B2 


1.8 


3850108 


(AL033388) putative calcium- 
transporting atpase 
[Schizosaccharomyces pombe] 


5.7 


115 


U13183 


Xenopus laevis 
(Xwm-4) mRNA, 
complete cds. 


1.8 




2494853 


PROBABLE " 

HYDROXY AC YLGLUTATHI 
ONE HYDROLASE 
(GLYOXALASE II) (GLX II) 
protein [Escherichia coli] 
>gi|I7S6406 (AE000130) 
probable 

lydroxyacy 1 glutathione 
lydrolase [Escherichia coli] 


5.5 



3?u 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















116 


S68944 


Na+/Cl(-)-dependent 

neurotransmitter 

transporter 


1.8 


2276316 


(Z96810) GLYT-1 LIKE [Homo 
sapiens ] 


5.5 


117 


M92905 


Rat calcium channel 
alpha- 1 subunit(rbB- 
I) mRNA, complete 
cds. 


1.8 


3165522 


(AF067607) Similar to cuticular 
collagen; C18H7.3 


5.5 


118 


X 12429 


Xenopus laevis UI 
70K gene exon 10 


1.8 


2735957 


(AF0 15685) reverse 
transcriptase domain protein 


3.3 


119 


DS3333 


Mouse hepatitis virus 
genomic RNA for 
spike protein, partial 
cds 


1.8 


3876559 


XebiviIj jinuijjuy iu numim — 
cyclin A/CDK2-associatd 
protein PI 9 (RNA polymerase 
elongation factor) 
(SW:SKPI_HUMAN); cDNA 
EST EMBL:T001 14 comes 
from this gene; cDNA EST 
yk390f 1 1 .5 comes from this 
gene; cDNA EST yk402el 1.5 
co... 

>gi|38772 16|gnI|PID|c 1346850 
protein PI9 (RNA polymerase 
elongation factor) gene; cDNA 
EST yk390fl 1.5 comes from 
this gene; cDNA EST 
yk402el 1.5 co... 


3.3 


120 


AP0I6972 


Cervus elaphus 
REDDEER 
mitochondrial D- 
loop, complete 
sequence 


1.8 


3878057 


(Z99942) similar to von 
Willebrand factor type A 
domain; cDNA EST yk412d4.5 
comes from this gene; cDNA 
EST yk4I2d4.3 comes from this 
gene 


3.2 


121 


AB010741 


Oncorhynchus mykiss 
mRNA for rtSox24, 
complete cds 


1.8 


1730805 


HYPUlHhI'HJALil.U KJJ 
PROTEIN IN RPS3-PSD1 
INTERGENIC REGION 
>gi|2132762|pir||S63129 
probable membrane protein 
YNL174w - yeast 
(Saccharomyces cerevisiae) 
>gi|1302152|gnl|PID|e239548 
(Z7I451)ORF YNL174w 
[Saccharomyces cerevisiae] 


2.5 


122 


U32844 


-laemophilus 
influenzae Rd section 
159 of 163 of the 
complete genome 


1.8 


72S910 


A-TYPE INCLUSION 
PROTEIN (ATI) camelpox 
virus >gi|6238l (X69774) 
84kDa A-type inclusion proiein 
unidentified! 


1.9 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 

-- - -- - 


P VALUE 
















123 


U18321 


( lull 141 It i \s 1 1 1 (L 

radiation resistance 
conferring protein 
mRN A. complete cds. 


1.8 


2133273 


ribosomal protein YS7 homolog 
Emericella nidulans 


1.4 


124 


M28668 


Human cystic fibrosis 
mRN A, encoding a 
presumed 
transmembrane 
conductance regulator 
(CFTR). > :; 
gb|Il 1500)11 1500 

Patent US 5407796 


1.8 


90492 


rill ttfrrt n rt f~* 1 1 f ct/~*\t* nn /~\ i tf a 

fUuggrin precursor - mouse 
(fragment) 


0.87 




AF064553 


Mus musculus NSD1 
protein rruxiNA, 
complete cds 


1.8 


2501207 


PROBABLE PROTEIN 
DISULFIDE ISOMERASE P5 

(U4041 1) Similar to protein 
uisuiiiue-isomerase. 
[Caenorhabditis eteeans] 


0.87 


126 




Human mRN A for 
KIAA0316gene, 
complete cas 


1 .0 


1 13 1 J 1 


R^GOLAtORY I^OTELN 

RftT A fTVRI^TT V \ PROTTTTsn 

£} rVJL* r\ \ U l\ 1 J I LL /*■ I I\.\J 1 £-11 * ) 

>gi|837l8|pir||A28913 
regulatory protein brlA - 
Emericella nidulans >gi|I68029 
vm.uojI) on a protein 




127 




Homo sapiens 
(subclone 10_d2 from 
P1H21)DNA 

;>CU 11CHLC 


I 8 

I .0 


A. 1 JJU-t 


metailoproteinase 1 {EC 3.4.24.- 

^ _ h 1 1 imn 
f tlUIJIUll 




128 


M37278 


R.norvegicus renin 
gene, exons 1-9. 


1.8 


40500S7 


(AF 109907) SL64(Homo 
sapiens] 


0.58 


129 


' X82879 


Artificial sequences 
DNA for ART 2 
consensus 


1.8 


310929 


(LI 3442) cysteine-rich extensin- 
like protein-4 [Nicotiana 
tabacum] 


0.52 


130 


D89729 


Homo sapiens mRN A 
for CRM I protein, 
complete cds 


1.8 


3559944 


(AJ010792) Muc5AC protein 
Mus musculus] 


0.38 


131 


U7S076 


Mus musculus 
sepiapterin reductase 
gene, exons 1 and 2 


1.8 


2984225 


(AE000766) enolase- 
phosphatase E-l [Aquifex 
aeolicus) 


0.095 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












COLLAGEN ALPHA I(VIII) 




132 


X52133 


Paramecium 168G 
gene for 168G 
surface protein 


1.8 


115316 


CHAIN PRECURSOR 

(ENDOTHELIAL 

COLLAGEN) 

>gi|105686|pir||S 15435 collagen 
alpha l(VIII) chain precursor - 


0.073 


133 


M77830 


Human desmoplakin I 
mRNA, complete cds. 


1.8 


1397246 


(Uol944) coded tor by L. 
elegans cDNA ykl 12f3.5; coded 
for by C. elegans cDNA 
cm21d2; coded for by C. 
elegans cDNA CEESR07F; 
coded for by C. elegans cDNA 
yk!12f3.3; coded for by C 
elegans cDNA CEESR29F 
[Caenorhabditis elegans] 


le-04 


134 


AJ224150 


Plasmodium berghei 
EF-1 alpha A- gene 


1.8 


1353761 


(U43 192) myosin 11 heavy chain 
[Naegleria fowleri] 


2e-05 


135 


AJ005518 


Mus musculus 
somatostatin receptor 
2 gene, exonl and 5' 
flanking region 


1.8 


1326350 


(U58748) similar to potential 
transmembrane domains in S. 
cerevisiae nulcear division 
RFT1 protein (SP:P38206) 


2e-08 


136 


AF002217 


Ralstonia eutropha 
megaplasmid pHGI 
nitric oxide reductase 
(norB) gene, 
complete cds 


1.8 


3393018 


(AL031174) hypothetical 
protein 


2e-08 


137 


AF039035 


Caenorhabditis 
elegans cosmid 
C53A3 


1.8 


3S50109 


(AL03338S) 3-oxoacyl-[acyl- 
carrier-protein]-synthase 


3e-ll 


138 


M8I769 


S.domesticus 
immunoglobulin 
rearranged gamma 
chain mRNA, VJC 
region, complete cds. 


1.8 


30S0527 


(AL022600) putative mannose-1 
phosphate gaunyl transferase 
[Schizosaccharomyces pombe] 


3e-I4 


139 


Yl 1106 


3 .pastoris PYCI gene 


1.8 


1175412 


HYPOTHETICAL 24.2 KD 
PROTEIN C13A 11.03 IN 
CHROMOSOME I >gi|984224 
(Z54096) unknown 


le-15 


140 


US7803 


-luman putative 
Ca2+/calmodulin- 
dependent protein 
;inase kinase gene. 3' 
lanking region, 
partial sequence 


1.8 


2S2S280 


(AL0216S7) putative protein 

Arabidopsis thaliana] 
>gi|2S32633|gn!|PID|e 1249651 
(AL02171 1) putative protein 

Arabidopsis thaliana] 


3e-17 
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Nearest Neighbor (BlastN vs. GenbanlO 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Plasmodium 










141 


AE001430 


falciparum 
chromosome 2, 
section 67 of 73 of 
the complete 
sequence 


1.8 


1931647 


(U95973) endomembrane 
protein EMP70 precusor isoloe 


2e-20 


142 


LI 9708 


Rat N-methyl-D- 
aspartate receptor 
(NMDAR1) gene, 
flr«t exon 


1 8 


1731 181 


HVPO™micAL7i>.5Kl> w 
PROTEIN C14A4.3 IN 
CHROMOSOME II 
>gi|3874230|gnl|PID|eI35 1618 
protein (Swiss Prot accession 
number P38376); cDNA EST 
yk220el0*5 comes from this 


3e-2l 


143 


Y 10728 


P schwaf?i 
mitochondrial cytb 
gene, partial 


1.8 


3878644 


(Z81 103) predicted using 
Genefinder; cDNA EST 
yk303gl 1.5 comes from this 
eerie- cDNA EST vk303e 1 1 3 
comes from this gene 
[Caenorhabditis elegans] 


Je-28 


144 


AB006631 


Homo sapiens mRNA 
for KIAA0293 gene, 
partial cds 


1.8 


4176500 


(AL031177) dJ889M15.3 (novel 
protein) 


7e-45 


145 


AF 106967 


Mus musculus 13 
protein mRNA, 
complete cds 


1.7 


<NONE> 


<NONE> 


<NONE> 


146 


AE001073 


Archaeoslobus 
fuleidus section 34 of 
172 of the complete 
genome 


1.7 


<NONE> 


<NONE> 


<NONE> 


147 


U 12977 


Pseuaomonas 
lemoignei poly (3- 
hydroxybutyrate) 
depolymerase A 
precursor (phaZ5) 
gene, complete cds, 
and glycero- 
phosphate - 
dehydrogenase 
homolog, complete 
cds. 


1.7 


<NONE> 


<NONE> 


<NONE> 


148 


M27038 


Mus musculus 
(SK/CamRk) 
germline IgK chain 
eene* J 1-5 region. 


1.7 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor {BlastN v S . Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






H.sapiens HBF- 1 










149 


X74I42 


mRNA for 
transcription factor 


1.7 


<NONE> 


<NONE> 


<NONE> 


150 


U40830 


Mreptococcus 
thermophilus DeoD 
gene, partial cds and 
EpsA, EpsB, EpsC, 
EpsD, EpsE, EpsF, 
EpsG, EpsH, EpsI, 
EpsJ, EpsK, EpsL, 
EpsM,Orfl4.9 
protein genes, 
complete cds 


1,7 


<NONE> 


<NONE> 


<NONE> 


151 


L29172 


Rabbit Ig germline 
gamma H-chain 
(allotype d 1 2,e 15) C- 
reeion gene, 3* end. 


1.7 


<NONE> 


<NONE> 


<NONE> 


152 


Ml 9045 


Human lysozyme 
mRNA, complete cds. 


1.7 


<NONE> 


<NONE> 


<NONE> 


153 


AE001 159 


Borrelia burgdorferi 
(section 45 of 70) of 
the complete genome 


1.7 


<NONE> 


<NONE> 


<NONE> 


154 


LI 7027 


Plasmid pFdA (from 
Fremyella 
diplosiphon) DNA 
sequence, including 
unidentified cds and 
stem loop. 


1.7 


' <NONE> 


<NONE> 


<NONE> 


155 


U12232 


Arabidopsis thaliana 
Columbia GTP 
binding protein beta 
subunit (AGB1) 
mRNA. complete cds. 


1.7 


<NONE> 


<NONE> 


<NONE> 


156 


D42056 


Arabidopsis thaliana 
ATPK6 mRNA for 
ribosomal -protein S6 
kinase homolog. 
complete cds 


1.7 


<NONE> 


<NONE> 


<NONE> 


157 


X9S117 


Rhizobium 

leguminosarum prsD, 
prsE, ORF3 genes 


1.7 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
rn 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















158 


AF039084 


Spinacia oleracea 
heat shock 70 protein 
protein, complete cds 


1.7 


<NONE> 


<NONE> 


<NONE> 


159 


Z12651 


R.norvegicus gene for 
catechol 

methvltransferase 


1.7 


<NONE> 


<NONE> 


<NONE> 


160 


AF002968 


Fringilla coelebs 
mitochondrial control 
region partial 
sequence 


1.7 


<NONE> 


<NONE> 


<NONE> 


161 


AE001160 


Borrelia burgdorferi 
(section 46 of 70) of 
the complete genome 


1.7 


<NONE> 


<NONE> 


<NONE> 


162 


U67553 


Methanococcus 
jannaschii section 95 
of 150 of the 
complete genome 


1.7 


<NONE> 


<NONE> 


<NONE> 


163 


M86247 


S-ruminantium 
plasmid pS23 DNA. 


1.7 


<NONE> 


<NONE> 


<NONE> 


164 


S74436 


oIL-8=interleukin~8 
[sheep, spleen cells, 
mRNA. 1435 nt] 


1.7 


<NONE> 


<NONE> 


<NONE> 


165 


D12719 


Candida maltosa 
ALK7 (CYP52A10) 
and ALKS complete 
cds 


17 


<NONE> 


<NONE> 


<NONE> 


166 


U02625 


Geotrichum 
candidum NRRL Y- 
553 lipase gene, 
partial cds. 


1.7 


321245 


230k bullous pemphigoid 
antisenBPMl - mouse 


9.3 


167 


Z58881 


H.sapiens CpG DNA, 
clone 1 14a4, reverse 
read cpgl I4a4.rtla . 


1.7 


1854675 


(U66298) bone morphogenetic 
protein-6 [Rattus norvegicus] 


9.1 


168 


U43674 


Agrobacterium 
tumeFaciens conjugal 
transfer region 1 
»enes 


1.7 


1352066 


LAKUL FKUL1NL-K1CH 
PROTEIN BAT2 MHC class III 
histocompatibility antigen HLA- 
B-associated transcript 2 - 
human >gt|l79339 (M33509) 
HLA-B-associated transcript 2 
(BAT2) [Homo sapiens] 
>gi|I79345 (M33518) HLA-B- 
associated transcript 2 (BAT2) 
[Homo, sapiens] 


9.1 
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SEQ 
ID 



Nearest Neighbor fBlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



P VALUE 



169 I AL023827 



Caenorhabditis 
elegans cosmid 
Y12A6A, complete 
sequence 
[Caenorhabditis 
elegans] 



PRO I UFUKPH i KINUUhN 



1.7 



731440 



UXJDASk (PFO) yeast 

(Saccharomyces cerevisiae) 
>gi|603606 {U 18778) Heml4p: 
protoporphyrinogen oxidase 
[Saccharomyces cerevisiae] 
>gi| 1403536|gnI|PlD|e249333 
(Z71381) protoporphyrinogen 
oxidase (Saccharomyces 
cerevisiae] 



8.9 



170 I X69662 



XJaevis mRNA for 
glutathione 
synthetase, large 
su burnt 



1.7 



4038057 



(AC005897) hypothetical 
protein [Arabidopsis thaliana] 



8.8 



171 



Z35824 



S cerevisiae 
chromosome II 
reading frame ORF 
YBL063w 



1.7 



3021450 



(YI5515) prdl-a [Hydra 
vulgaris] 



iga 

CULLAGLN ALPHA 1(X11) 

CHAIN PRECURSOR l(XII) 

chain - chicken 

>gi|222811|gnl|PIDfdI00I 160 
gallus] 

>gi|2326442|gnI|PID|e39435 

(X6I024) collagen type XII 

alpha 1 chain [Gallus gallus] 
1YT™ " ' " " 



7.0 



172 M65139 



Cowpea chlorotic 
mottle virus (CCMV) 
la protein gene, 
complete cds. 



173 I X15065 



Drosophila distal BX 
C region (bithorax 
complex) pH189 5' 

esion; 



1.7 



2506307 



1.7 



1723625 



POTHJtllCAL 10 0 KD 
PROTEIN IN ALPA-GABD 
NTERGENIC REGION (FS7) 
>gi|1033124 (U36840) 
ORF_f87 [Escherichia coli] 
>gi| 1788982 (AE000348) orf, 
hypothetical protein 



7.0 



1 69 



^1 t 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















174 


Z46255 


S.cerevisiae 

rhrnmn^nmp VT 

lambda clone. 


• 

1.7 


• 

3875228 


(Z46792) similar to lethal(I) 
discs large- 1 tumor suppressor 
protein- like repeats; cDNA EST 
EMBL:D33495 comes from this 
gene; cDNA EST 
EMBL:D35117 comes from this 
gene; cDNA EST 
EMBL:D36356 comes from this 
gene;cDNA EST EMB... 
>gi|3879984|gnl|PID|el35l767 
suppressor protein-like repeats; 
cDNA EST EMBL:D33495 
comes from this gene; cDNA 
EST EMBL:D351 17 comes 
from this gene; cDNA EST 
EMBLD36356 comes from this 

Irfl*ll/Li»iAy J vJJv WlilwO 1 1 V/11J I'll J 

gene; cDNA EST EMB... 


- 

6.7 


175 


U01066 


Human CD4 

nrnmnlpr nurtinl 

sequence. 


17 


125448 


THYMIDINE KINASE 

^.Timtriinc hpm**<;vtnK 1 fotrnin 

JulUJJI Illl^i I J 1 \j Wu Til UJ 1 I JU UJ(1 

ll[Onc]) >gi|60341 


6.7 


176 




Phalaenopsis sp. 
'hybrid SM9108* 
homeobox protein 

ml?NA f*nmnlf*te crfs 


1 7 


1022918 


(U38184) ATPase subunit 6 

fTrvnnnO^omA cnizil 


6.7 


177 


U 14662 


Baboon herpesvirus 
HVP2 gB 

glycoprotein (UL27) 
°erte complete cds. 


1.7 


3218378 


(AL023862) hypothetical 
protein SC3F9.07 [Streptomyces 
coelicolor] 


6.7 


178 


ABO 17006 


Homo sapiens 
PMS2L15 mRNA, 
partial cds 


1.7 


1465855 


(U64859) glutamine-rich protein 
[Caenorhabditis elegans] 


6.7 


179 


U9265 1 


Brassica oleracea var. 
botrytis tonoptast 
intrinsic protein 
bobTIP26-l mRNA, 
complete cds 


1.7 


3023675 


DYNEIN HEAVY CHAIN, 
CYTOSOLIC (DYHC) dynein 
heavy chain 

[Schizosaccharomyces pombe] 


6.6 


180 


AF000634 


Lytechinus variegatus 
notch homolog 
mRNA. complete cds 


1.7 


148574 


(M58520) endo-l,4-beta- 
glucanase [Ftbrobacter 
succinosenes] 


6.6 
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Nearest Neighbor CBIasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRfPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















18! 


M92354 


Arabidopsis thaliana 
anthranilate synthase 
alpha subunit gene, 
complete cds. 


1.7 


738308 


blue light photoreceptor 
[Arabidopsis thalianal 


6.D 


132 


AJ2 34856 


Hordeum vulgare 
genomic DNA 
fragment; clone 
M\VG2234.rev 


1.7 


3142302 


(AC00241 1) Strong similarity to 
myosin heavy chain gb|234293 
from A. thaliana. [Arabidopsis 
thaliana] 


6.5 


183 


U76827 


Stercorarius 
parasiticus bird J33 
cytochrome b protein, 
partial cds 


' L7 


3413810 


(Y17034) Bassoon [Mus 
musculus) 


5.4 


184 


U05211 


Saccharomyces 
cerevisiae Ttplp 
(TTP1) gene, 
complete cds. 


1.7 


403173 


(L24492) lipoprotein 
[Rhodococcus erythropolis] 


4.9 


185 


AF076974 


Homo sapiens 
TRRAP protein 
(TRRAP) mRNA, 
complete cds 


1.7 


1170140 


PUTATIVE 

ENDOGLUCANASE TYPE K 
PRECURSOR (ENDO- 1 ,4- 
B ETA-GLUC ANASE) 
(CELLULASE) 


4.1 


186 


AE000753 


Aqujfex aeolicus 
section 85 of 109 of 
the complete genome 


1.7 


11 69357 


DNA ADENINE METHYLASE 
site-specific DNA- 
methyltransferase (adenine- 
specific) dam methylase gene 
product [Vibrio cholerae] 


4.0 


187 


AF0O5638 


Tupaia glis 
apolipoprotein AI 
prepropeptide 
mRNA, complete cds 


1.7 


3355682 


(AL031 124) putative secreted 
lyase 


4.0 


188 


M23090 


Human germline IgK 
chain gene V3-region, 
clone Humkv328h5 


1.7 


2257483 


(AB004534) pi003 
[Schizosaccharomyces pombe] 


4.0 


189 


M24001 


Mink enteritis virus 
antigenic type 2 
capsid protein genes 
VP1 and VP2, 
complete cds. 


1.7 


2143504 


myotonic dystrophy kinase - 
mouse (fragment) kinase, DM- 
kinase {C-terminal, alternatively 
spliced, clone delta IUIIJV.V J 
[mice, brain, Peptide Partial, 
474 aa] [Mus sp.] 


3.9 


190 


X59964 


H. sapiens CST4 gene 
for Cystatin D 


1.7 


1766075 


(U37273) winged helix protein 
CWH-2 [Gallus galius] 


3.1 
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Nearest Neighbor CBIasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












HYPOTHETIC! Al 1 1.7 kd 




191 


X95276 


P. falciparum 
complete gene map of 
plastid-like DNA (IR- 
B) 


1.7 


3219951 


PROTEIN C6B 12. 13 IN 
CHROMOSOME I 
>gi[2330843|gnl|PID|e334047 
pombe] 


3.0 


192 


D84487 


Rat PMSG- induced 
ovarian mRNA, 
3'sequence, N10 


1.7 


173164 


. 

(J02719) valyl-tRNA synthetase 
[Saccharomyces cerevisiae] 


2.3 


193 


L 14851 


Rattus norvegicus 
neurexin Ill-alpha 
gene, complete cds. 


1.7 


3323586 


(AF060869) single-strand 
binding protein [Salmonella 
typhimurium] 


2.3 


194 


M97002 


Xenopus laevis/gilli 
hybrid pseudo-IgH 
chain gene, V region, 
clone LG7G342A. 


1.7 


2118407 


MHC sex- limited protein - 
mouse (fragment) musculus] 


2.3 


195 


L07025 


Eracrnus'tnirnngiynsis 
delta-endotoxin 
(Cry A (a)) gene, 5' 
end. > :: 

gb|I34520|I34520 
Sequence 1 from 
patent US 559607 1 > 

gb|I39790jI39790 
Sequence 1 from 
patent US 5616495 > 

gb|AR0OS4S7|AR0O8 
487 Sequence I from 
patent US 5753492 


1.7 


2496940 


HYPOTHETICAL 53.4 KD 
PROTEIN D 1054. 13 IN 
CHROMOSOME V 
>gi|38753 1 6|gnl|PID|e 1344967 


LS 


196 


S73149 


insulin- 1 ike growth 
factor II (intron 7| 
[human, Genomic, 
1702 nt] 


1.7 


3327038 


(AB0145I2) KIAA0612 protein 
[Homo sapiens] 


1.8 


197 


D86990 


Human (lambda) 
DNA for 

immunoglobulin light 
chain 


1.7 


494367 


Fv Fragment (Murine Se 155-4) 
Complex With The 
Trisaccharide: Alpha-D- 
Galactose( l-2)[alpha-D- 
Abequose(I-3)]alpha- D- 
Mannose (Pl-Ome) (Part Of 
The Cell-Surface Carbohydrate 
Of Pathogenic Salmonella) 


1.8 
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Nearest Neighbor (BlastN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



SEQ 
ID 



ACCESSION 



DESCRIPTION 



P VALUE 



ACCESSION 



DESCRIPTION 



P VALUE 



198 



Plasm id pFdA (from 



LI 7027 



-remyella 
diplosiphon) DNA 
sequence, including 
unidentified cds and 
stem loop. 



1.7 



1082702 



polio virus receptor- related 
protein - human 



1.4 



200 



201 



202 



203 



205 



206 



207 



208 



U89926 



Caenorhabditis 
elegans cosmid 
H22D14, complete 
sequence 
Caenorhabditis 
elegans] 

>osophila 
melanogaster cut 
gene, partial sequence 



3924605 



1.7 



2245100 



(AF069442) putative inhibitor 
of apoptosis [Arabidopsis 
thaliana] 



(Z97343) DNA-binding protein 
homolos 



Z25749 



U59841 



H.sapiens gene for 
ribosomal protein S7 
Fundulus heteroclitus 
lactate dehydrogenase 
B 



1.7 



2493459 



3005587 



PROTEIN KINASE C 
SUBSTRATE, 60.1 KD 
PROTEIN, HEAVY CHAIN 
(PKCSH) (80K-H PROTEIN) 

>gi] 12 15746 

(AF048977) Ser/Arg-related 
nuclear matrix protein [Homo 
sapiens] 



X55763 



Rabbit mRNA for 
smooth muscle 
calcium channel 
blocker (CaCB) 
rec eptor 



1.7 



3883128 



U50912 



Caenorhabditis 
elegans cosmid 
C18B 12 A. complete 

sequence 
[Caenorhabditis 

elegans] 
Human XIST gene, 

poly purine- 

pyrimidine repeat 

resion 



940397 



1.7 



2338027 



X12817 



Ovis aries beta- 
lactoglobulin gene 



1.7 



987050 



AF004419 



Homo sapiens 
troponin T (TNNT2) 
eene, exon 13 



1.7 



2996364 



L43643 



Gallus domesticus 
DNA microsaiellite 
marker MCW119 



1.7 



464S96 



(AF082302) arabinogalactan- 
protein [Arabidopsis thaliana] 



(D 10123) core [Hepatitis C 
virus] 

(AF005370) large tegument 
protein [Alcelaphine herpesvirus 

1] 



(X65335) lacZ gene product 
[unidentified cloning vector] 



(AF053947) unknown [Yersinia 
pestis] >gi|3S83090 



TRANSDUCIN-UKE 
ENHANCER PROTEIN 1 
enhancer-of-split homolog TLE 
1 - human >gi|3075IO 



3b 



1.4 



1.3 



0-S2 



0.82 



0.S0 



0.59 



0.45 



0.2: 



0.20 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins* 


SEQ 
ID 


ACCESSION 


DESCRIPTION 

- 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















209 


273278 


o.cerev isiae 
chromosome XII 
reading frame ORF 
YLR106C 


1.7 


1351657 


PROTEIN C30DI 1.04C IN 
CHROMOSOME I 
>gi|2l304U|pir||S62562 
hypothetical protein 
SPAC30D 1 1 .4c - fission yeast 
nuclear pore complex protein 
[Schizosacchaxomyces pombe] 


0.20 


210 


M22345 


Mouse endogenous 
provirus gag, pol, anc 
env region DNA. 


1.7 


2444455 


(AF020765) hypothetical 
protein [Myxococcus xanthus] 


0.12 




AEO0O360 


Escherichia colt K-12 
MG1655 section 250 
of 400 of the 
complete genome 


1.7 


2736361 


(AF039038) No definition line 
found [Caenorhabditis elegans] 


0.12 


212 


AB020692 


Homo sapiens mRNA 
for KIAA0SS5 
protein, complete cds 


1.7 


2605924 


(AF029726) histidine kinase C 
[Dictyostelium discoideum] 


0.094 


213 


S69429 


icsiis-aeierTTiining 
gene/SRY homolog 
[Sminthopsis 
macroura=srriped- 
lacea aunnans, 
Genomic, 855 nt] 


1.7 


2499016 


TONB PROTEIN >gi| 16665 36 
(U23764) TonB [Pseudomonas 
aeruginosa] 


0.092 


214 


S69429 


testis-determining 
gene/SRY homolog 

Sminthopsis 
macroura=striped- 

aced d unn arts. 
Genomic. 855 nt] 


1.7 


2499016 


TONB PROTEIN >gi|1666536 
(U23764) TonB [Pseudomonas 
aeruginosa] 


0.088 


215 


U67205 


VIus musculus ACF7 
neural isofonm 3 

partial cds 


1.7 


2047349 


(ArUUUl yo; weak similarity to 
HSP90 [Caenorhabditis elesans] 


0.052 


216 


X98188 


Artificial DNA 
sequence for 
mammalian lambda- 
neo minichromosome, 
1400 bp 


1.7 


2493779 


MUTATIVE CUTlCLt * 
COLLAGEN C09G5.6 
collagen; cDNA EST yk244c3.5 
comes from this gene; cDNA 
EST yk244c3.3 comes from this 
gene (Caenorhabditis elegans] 


0.042 


217 


U70139 


Mus musculus 
putative CCR4 
protein rnRNA. 
partial cds 


1.7 


2252630 


(U95973) hypothetical protein 
[Arabidopsis thaliana] 


0.041 



33 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlasiX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DfcoCKlr i lUN 


P VALUE 


ACCESSION 


ric c/^t> rDTT r\t\i 


P VALUE 
















218 


L38808 


Homo sapiens alpha- 
1 type V collagen 
(COL5Al)gene. 5' 
flank and exon 1. 


1.7 


2895760 


(AF045246) universal minicircle 
sequence binding protein 
minicircle sequence binding 
protein [Crithidia fasciculata] 


0.039 


219 


Z72I51 


B.napus mRNA for 
AMP-bindine protein 


1.7 


190475 


(K02576) salivary proline- rich 
protein 1 [Homo sapiens] 


0.011 


220 


X94152 


R.norvegicus mRNA 
for cysteine sulfinate 
decarboxylase 


1.7 


2136212 


synapsin Db - human 
>gi|1594277 (U40215) synapsin 
lib [Homo sapiens] 


0.008 


221 


L20255 


Mouse stathmin gene 
sequence. 


1.7 


2317934 


(U97553) unknown [murine 
herpesvirus 681 


0.006 


222 


LI 3600 


Rattus norvegicus 
glycine transporter 
mRNA, complete cds. 


1.7 


726403 


(U23 175) similar to anion 
exchange protein 
[Caenorhabditis elegans] 


0003 


223 


AJ224150 


Plasmodium berghei 
EF-lalpha A-gene 


1.7 


2072290 


(U95094) XL-LNCENP 
[Xenopus laevis] 


0.001 


224 


S 80642 


butyrophilin [mice, 
lactating mammary 
gland , mRNA Partial, 
3193 ml 


1.7 


2695746 


(AJ223010) Pmt2 
[Schizosacchajomyces pombe] 


9e-04 


225 


M22363 


Celegans unc-86 
gene encoding two 
alternative proteins, 
complete cds. 


1.7 


2224683 


(AB002369) KIAA0371 [Homo 
sapiensl 


le-04 


226 


X92123 


M.musculus cgt gene 
exon 1 


1.7 


3874232 


(Z49909) similar to Prokaryotic 
ribonuc lease PH 
[Caenorhabditis elegans] 


3e-0D 


227 


ABO 16000 


Ipomoea nil PKn2 
(knotted-like gene) 
mRNA, complete cds 


1.7 


2183083 


(AF000422) '1TM interacting 
peptide 5 [Homo sapiens] 


lc-05 


228 


D14133 


Bovine mRNA for 
synaptocanalin 1 


1.7 


392527.7 


(ALU32C)43J similar to 
Un characteri zed protein family 
UPF0034, Double-stranded 
RNA binding motif; cDNA EST 
yk489b3 5 comes from this 
gene; cDNA EST yk439g7.5 
comes from this gene 
[Caenorhabditis elegans] 


2e-06 
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Nearest Neighbor f BlastN vs. Genbarik) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 

















229 


L0I99I 


Mus musculus TAFG- 
1 -like neuronal 
glycoprotein (PCS) 
mRNA, complete cds. 


1.7 


3006139 


(AL022299) hypothetical 
protein 


4e-07 


230 


X63016 


Tomato yellow leaf 
curl virus Thailand 
isolate complete 
genome (TYLCV-TH 
B-DNA) 


1.7 


3643608 


(AC005395) hypothetical 
protein [Arabidopsis thaliana] 


le-07 


231 


Z22802 


H.sapiens 

microsatellite repeat. 

> :: 

gb|G34562|G34562 
human STS SHGC- 
51834 


1.7 


100210 


extensin precursor (clone Tom L 
4) - tomato esculentum] 


4e-09 


232 


K02765 


Human complement 
component C3 
mRNA, alpha and 
beta subunits, 
complete cds. 


1.7 


2984320 


(AE000773) acetoin utilization 
protein [Aquifex aeolicus] 


le-09 


233 


Z748I8 


Sxerevisiae 
chromosome XV 
reading frame ORF 
YOL076w 


1.7 


3873700 


(Z/3102) predicted using 

Genefmder; Similarity to 

Bacillus subtilis DNAJ protein 

gene; cDNA EST 

EMBL:C 12520 comes from this 

gene; cDNA EST 

EMBL:D7 1409 comes from this 

gc... 


7c- 11 


234 


D21871 


Pig mRNA for thimet 
oli°opeptidase 


1.7 


2632098 


(Y 155 1 3) Prodos protein 
[Drosophila melanogaster] 


8c- 13 


235 


Y 14344 


Gallus gallus gene 
encoding neurofascin, 
exons9,10,ll & 12 


1.7 


3876421 


(281070) cDNAEST 
EMBL:CI2730 comes from this 
gene; cDNA EST yk200b6.5 
comes from this gene; cDNA 
EST yk349al2.5 comes from 
this gene [Caenorhabditis 
elegansl 


3c-14 


236 


Z73608 


S.cerevisiae 
chromosome XVI 
reading frame ORF 
YPL252c 


1.7 


1439663 


(U64605) C05D9.6 gene 
product [Caenorhabditis 
elegans] 


6c- 18 



Vb5 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












OLIOOSaCCHarYL 




237 


AG000518 


Homo sapiens 
genomic DNA. 2 Iq 
region, clone: 
T171N23 


1.7 


1174468 


TRANSFERASE STT3 
SUB UNIT HOMOLOG 
>gi|529357(Ul3019) No 
definition line found 
[Caenorhabditis elegans] 


6c- 18 


238 


D177I6 


Human mRNA for N- 
acetylgtucosaminyltra 
nsferase V, complete 
cds 


1.7 


961446 


(D63877) KJAA0157 gene 
product is novel. 


5c- 1 9 


239 


AF102512 


Lheiiodactyius 
vittatus country USA: 
Midway Island 
cytochrome c oxidase 
subunic I gene, 
mitochondrial gene 
encoding 
mitochondrial 
protein, partial cds 


- 

1.7 


1572756 


(U70848)C43G2.1 gene 
product [Caenorhabditis 
elegans] 


5e-40 


240 


L30107 


Rattus norvegicus 
liver-specific 
transDorter °ene. 
promoter region. 


1.7 


4176443 


(AL 0222 38) dJ]042KJ0 4 
(novel protein) 


3e-49 


241 


X91220 


H.sapiens rnRNA for 
Na-Ct eiectroneutral 
thiazide-sensitive 
cotransporter 


1.7 


3478637 


(AC005546) R29425.1 [Homo 
sapiens] 


6e-54 


242 


U97I46 


Rattus norvegicus 
c ale i um- i ndepe nde nt 
phospholipase A2 
mRNA, complete cds 


1.6 


<NONE> 


<NONE> 


<NONE> 


243 


Z43508 


Pea seed-borne 
mosaic virus RNA for 
coat protein and 
polymerase (partial) 


1.6 


<NONE> 


<NONE> 


<NONE> 


244 


MIS349 


Rat leukocyte 
common antigen (L- 
CA) gene, exons 1 
throush 5. 


1.6 


<NONE> 


<NONE> 


<NONE> 


245 


M13158 


Yeast (S.pombe) 
cdc25+ gene (mitosis 
initiation), complete 
cds. 


1.6 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 



247 
248 



251 



252 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins ; 



ACCESSION 



246 I U39712 



Mycoplasma 



genitalium section 34 
of 5 1 of the complete 
genome 



Ml 7922 
M89986 



249 I M89986 



250 U68725 



X95199 



Mouse Murine 
urokinase- type 
plasminogen activator 
protein gene, 
complete cds. 
Human polymorphic 
loci in Xq28 



Human polymorphic 
loci in Xq28. 



Rattus norvegicus 
Deleted in colorectal 
Cancer 



P.platessa GSTA. 
GSTA1, GSTA2, and 
PPTN senes 



Y09103 



253 I Z 14078 



254 I AB002314 



255 I M21488 



DESCRIPTION 



1.6 



1.6 
1.6 



1.6 



1.6 



1.6 



D.melanosaster 
RPA1 sene 



T.aestivum 
mitochondrion fMet, 
18S, 5S repeat unit 
DNA 



Human mRNA for 
KIAA0316gene T 
complete cds 



Human muscle 
creatine kinase gene 
CKMM).exon 2. 



1.6 



1.6 



1.6 



1.6 



2143805 



inositol-polyphosphate 4- 
phosphatase - rat 



1256804 



(U5i449)RING3 protein 
[Xenopus laevis] 



3915113 



39 1 602 1 



2501668 



DYSTROPHIN- RELATED 
PROTEIN 2 sapiens] 



P VALUE 



<NONE> | <NONE> | <NONE> 

(ZM4y9) predicted using 
Genefinder; cDNA EST 
yk410e3.3 comes from this 
gene; cDNA EST yk410e3.5 
comes from this gene 
3875750 [Caenorhabdiris elegansl I 8.0 

(Z84724) psd [Mycobacterium 
3261710 (tuberculosis] | 6.4 



130997 



REPETITIVE PROLINE-RICH 
CELL WALL PROTEIN 1 
PRECURSOR 

>gi|8!809|pir||A29324 proline- 
rich protein precursor - soybean 
>gi[ 170049 (J02746) proline- 
rich protein [Glycine max] 



119399 



ENV POLY PROTEIN 
PRECURSOR (COAT 
POLYPROTEIN) [CONTAINS: 
COAT PROTEIN GP62; COAT| 
[PROTEIN GP40] 



6.2 



5.8 



fMALEYLACETATE 
REDUCTASE Pseudomonas 
cepacia >gi|643636 (U 19883) 
maleylacetate reductase 

[Burkholderia cepacia] | 4.9 

HYPOTHETICAL $1 KD 
PROTEIN IN COB 1NTRON 
>gi|2654230|g n ||PID|el 192341 
(X02819) unidentified reading 
I frame [Schizosaccharomyces 
[pom be] | 4.8 



3.6 



2.8 



2.2 
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Nearest Neighbor (BlasuN vs. Genbank) 


Nearest Neighbor (BlasiX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















256 


AE001 164 


Borrelia burgdorferi 
(section 50 of 70) of 
the complete genome 


1.6 


4050089 


(AF 109907) hypothetical 
protein [Homo sapiens] 


1.5 


257 


X61757 


M.musculus 
rearranged T-cel! 
receptor beta variable 
region (Vbl7a) 


1.6 


3377766 


(AF080090) semaphorin IV 
isoform b [Mus musculus] 


1.2 


258 


MI5346 


T.cruzi tandemly 
repeated gene 
encoding an 85 kDa 
antigen with 
homology to heat 
shock proteins. 


1.6 


2804437 


(AF043695) similar to zinc 
metal lODrotease familv of 
peptidases [Caenorhabditis 
elegans] 


0.41 


259 


L390I8 


Rattus norvegicus 
sodium channel 
protein 6 (SCP6) 
mRNA. complete cds 


1.6 


2920535 


(AF01 8081) type XVIII 
collagen [Homo sapiens] 


0.037 


260 


M29483 


Human leukocyte 
adhesion protein 
pi 50,95 alpha subunit 
gene, exons 7 - 15. 


1.6 


1840045 


(U49082) transporter protein 
[Homo sapiens] 


2e-09 


261 


L06844 


Aspergillus niger beta 
D-fructofuranosidase 
(sucl) gene»one 
exon. 


1.6 


4206210 


(AF071527) putative calcium 
rhiinnH f ArnhidonSK thali.mal 


9e-10 


262 


Mi 0946 


Chicken aldolase B 
gene, complete cds, 
clones lambda- 
C(U. 1.4). 


1.6 


2746775 


(AF040640) similar to peptidase 
family C19 (ubiquitin carboxyl- 
terminal hydrolase) 
(Caenorhabditis elegans] 


Ic-31 


263 


X07881 


Human sene PRB3L 
for pro line- rich 
protein Gl 


1.5 


<NONE> 


<NONE> 


<NONE> 


264 


U22260 


Vicoiiana tabacum 
UMP synthase (pyr5- 
6) rnRNA, partial cds 


1.5 


3880923 


(Z99271) similar to Reverse 
transcriptase comes from this 
gene [Caenorhabditis elegans] 


0.50 


[ 265 


U76759 


VI us musculus 
nuclear protein 
NIP45 mRNA. 
complete cds 


1.4 


1330394 


(U58761)C01Fl.6gene product 
[Caenorhabditis elegans] 


8.9 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












PU1ASS10M- 




266 


AF076470 


Rice tungro 
bacilli form virus 
Serdang strain, 
complete genome 


1.4 


1703461 


TKANSPOKT1NU ATPASE — 
BETA CHAIN (PROTON 
PUMP) (GASTRIC H+/K+ 
ATPASE BETA SUBUNIT) 
3.6.1 .36) beta chain - human 
>gi|184105 (M75110) HJC- 
ATPase beta subunit [Homo 
sapiens] 


8.9 


267 


X64659 


Cjacchus interferon 
gene for interferon 
gamma 


1.4 


' 1486485 


(U28832) USlOfGallid 
herpesvirus 1] >gi|I486497 


6.8 


268 


U11825 


Schistosoma 
japonicum structural 
muscle protein 
paramyosin mRNA, 
complete cds. 


0.88 


<NONE> 


<NONE> 


<NONE> 


269 


D84278 


Human DNA for 
CD38. exon I 


0.68 


3766363 


(AL031907) hypothetical serine 
rich protein 

[Schizosaccharomyces pombe] 


3.0 


270 


M59755 


Bovine lens aldose 
reductase 

pseudosene, 3' end. 


0.67 


<NONE> 


<NONE> 


<NONE> 


271 


M8175S 


Homo sapiens 
skeletal muscle 
voltage- depe ndent 
sodium channel alpha 
subunit (SkMl) 
mRNA. complete cds. 


0.65 


2437819 


(Z86105) 1.4-beta-glucanase 
Anaerocellum thermophilum] 


3.6 


272 


L01965 


■iuman type IV 
sodium channel alpha 
polypeptide 


0.64 


2437819 


(Z86105) l t 4-beta-glucanase 
Anaerocellum thermophilum] 


3.5 


273 


U90122 


>anio rerio bone 
morphogenetic 
protein-4 (bmp4) 
mRNA, partial cds 


0.63 


2983532 


(AE000720) formate 
dehydrogenase alpha subunit 
Aquifex aeolicus] 


7.9 


274 


L41624 


-lylobates lar mucin 
>1UC1) gene, exons 
1-6. 


0.63 


/ 1517808 


(D79215) FGF-10 [Rattus 
norveeicus] 


0.91 



5^9 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(1167956) coded tor by <J. 




275 


AF030881 


Fugu rubripes sushi 
retrotransposon gag 
polyprotein (gag) and 
pol polyprotein (pol) 
eenes, complete cds 


■I., — - 
0.63 


1519696 


elegans cDNA yk!26r9.5; coded 
for by C. elegans cDNA 
yk!59h63; coded for by C 
elegans cDNA ykl26f9.3; coded 
for by C elegans cDNA 
ykl59h6.5 [Caenorhabditis 
elegans] 


038 


276 


U52909 


Arabidopsis thaliana 
Ul snRNP 70K 
protein gene, 
complete cds 


0.62 


<NONE> 


<NONE> 


<NONE> 


277 


AF008192 


Homo sapiens 
putative GR6 protein 
(GR6) mRiNA, 
complete cds 


0.62 


3800934 


(AF100655) contains similarity 
to ser/thr protein kinases 
[Caenorhabditis elegans] 


9.7 


278 


U17081 


Human fatty acid 
binding protein 
(FABP3) gene, 
complete cds 


0.62 


3617848 


(AF049709) tyrosylprotein 
sulfotransferase-A; TPST-A 


7.7 


279 


AB018340 


Homo sapiens mRNA 
for KIAA0797 
protein, partial cds 


0.62 


424044 


VP5 protein - porcine rotavirus 
>-i|61355 


7.7 


280 


Y0O093 


H.sapiens mRNA for 
leukocyte adhesion 
glycoprotein pi 50,95 


0.62 


1054945 


(U38621) polyprotein [Tobacco 
vein mottlina virus] 


4.5 


281 


M63I38 


Human cathepsin D 
(catD) gene, exons 7, 
8, and 9. 


0.62 


136810 


GLYCOPROTEIN M 
>gi|73791|pir||WMBE51 UL10 
protein - human herpesvirus 1 1- 
473) [Human herpesvirus 1 ] 
>gi|22 1732|gnl|PID|dl002 1 3 1 


3.5 


282 


X76056 


N. syivestris DNA for 
spacer region 
between 25S and 1 SS 
ribosomal RNA genes 


0.62 


2661 176 


(U76671) putative cds 
[Rhodobaeter sphaeroides] 


2.0 


283 


X74501 


B.taurus mRNA for 
ACTH receptor 


0.62 


4249552 


(AB001075) galectin-2 related 
protein 


2.0 


284 


M57634 


RatFl-ATPase beta 
subunit mRNA. 3* 
end. 


0.62 


2119692 


transforming growth factor- beta 
type III receptor - chicken 
>gi|51 184? (L01121) 
transforming growth factor-beta 
type III receptor [Gallus gallus] 


1.5 
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Nearest Neighbor (BlastN vs. Gen bank) 


Nearest Neighbor (BfastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 1 DESCRIPTION 


P VALUE 












ASPAR1 YLMS PAR AGIN YL 




285 


Y 15724 


Homo sapiens 
SERCA3 gene, exons 
1-7 (and^joined CDS) 


0.62 


2498164 


BbiA-HYUiROXYLASE 
(ASPARTATE BETA- 
HYDROXYLASE) (ASP BETA 
HYDROXYLASE) (PEPTIDE- 
ASPARTATE BETA- 
DIOXYGENASE) beta- 
dioxygenase (EC 1.14.11.16)- 
bovine >gi| 162694 taurus] 


0.52 


286 


AL010142 


Plasmodium 
falciparum DNA *** 
SEQUENCING IN 
PROGRESS *** 
from contig 3-72, 
complete sequence 


0.62 


3183206 


HYPOTHETICAL PROTEIN 
KIAA0009 sapiens] 


4e-07 


287 


AB0O8160 


Mus musculus Stat3 
gene, 5-flanking 
region and exon 1 
partial sequence 


0.62 


466097 


HYPOTRETiC'AL 6 Jo Kl> 
PROTEIN 2K353.1 IN 
CHROMOSOME III 
>gi|1078903|pir||S44654 
ZK353.1 protein - 
Caenorhabditis etegans 
>2i|289757 (L15313) putative 
[Caenorhabditis elegans] 


lc-35 


288 


ABO 1 8795 


Ha lorn on as marina 
gene for alginate 
lyase, complete cds 


0.62 


3877493 


(Z483Sj ) similar to A I V ases 
associated with various cellular 
activities (AAA); cDNA EST 
EMBL:Z 14623 comes from this 
gene; cDNA EST 
EMBL:D75090 comes from this 
gene; cDNA EST 
EMBL.D72255 comes from this 
gene; cDNA EST yk200e4... 


3e-46 


289 


Z69906 


Human DNA 
sequence from 
cosmid E141E2, on 
chromosome 22, 
complete sequence 
Homo sapiens] 


0.61 


<NONE> 


<NONE> 


<NONE> 


290 


U 18259 


Human clone C11TA- 
SMHCclass II 
transactivator CI1TA 
mRNA, complete cds. 


0.61 


1483567 


(X79983) viral proteinase 
Pseudorabies virus] 


9.8 


291 


X98890 


S. tuberosum mRNA 
for inorganic 
phosphate 
transporter. StPTl 


0.61 


475724 


(U0SS84) protein VIII precursor 
Bovine adenovirus type 3] 


7.6 



Mi 
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Nearest Neighbor (BlastN vs. GenbanlO 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















292 


U70825 


Rattus norvegicus 5- 
oxo-L- prolinase 
mRNA, complete cds 


0.61 


733543 


(U23448) similar to genome 
polyprotein 

(SP:POLG_BVDVN, P197U); 
alternative splicing to C04A2.7a 


4.4 


293 


L8I667 


Homo sapiens 
(subclone 2_a9 from 
PI H49) DNA 
sequence 


0.61 


2565087 


(U80759) CAGH4 alternate 
open reading frame [Homo 
sapiens] 


3.3 


294 


AE000760 


Aquifex aeolicus 
section 92 of 109 of 
the complete genome 


0.61 


2811092 


HOMEOBOX PROTEIN HOX- 
A3 (HOX-l.5)homeobox- 
containing transcription factor 
[Mus musculus] 


2.6 


295 


U585P 


Mus musculus Rho- 
associated, coiled- 
coil forming protein 
kinase pi 60 ROCK-1 
mRNA comolete cds 


0.61 


295671 


(LI 1275) selected as a weak 
suppressor of a mutant of the 
subunit AC40 of DNA 
dependant RNA polymerase I 
and III 


1.5 


296 


U27459 


Human origin 
recognition complex 
protein 2 homolog 
hORC2L mRNA, 
complete cds 


0.61 


200285 


(M97900) putative open reading 
frame [Mus musculus] 


0.66 


297 


L36680 


Pisum sativum S- 
adenosylmethionine 
synthase mRNA, 3* 
end. 


0.61 


2285790 


(AB002086) p47 [Rattus 
norvegicus] 


4e-L2 


298 


Aii 00067 3 


Aquifex aeolicus 
section 5 of 109 of 
the complete genome 


0.61 


3395782 


(AF058446) histone 
macroH2A1.2 [Gallus sallus] 


6e-27 


299 


AF086310 


Homo sapiens full 
length insert cDNA 
clone ZD51F08 


0.61 


3646450 


(AL031603) conserved 
hypothetical protein. 
[Schizosaccharornyces pombe] 


8e-29 


300 


AJ009675 


Agroiis ipsilon 
mRNA for 3-hydroxy- 
3-methylgluiaryl 
coenzyme A 
reductase 


0.61 


4176370 


(AC005058) similar to calcium- 
independent phospholipase A2; 
similar to AC004392 
(PID:g33675l9) [Homo 
sapiens] 


2e-73 


301 


AC005577 


Homo sapiens 
chromosome 19. 
cosmidFIS3S2B, 
centromeric end, 
complete sequence 
[Homo sapiens] 


0.60 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Candida albicans 










302 


U40454 


topoisomerase type i 
(CATOPI) gene, 
complete cds 


060 


<NONE> 


<NONE> 


<NONE> 


303 


JO 1390 


Emericella ntdulans 
mtDNA between 
h2/h5 and br^2 
junctions, genes for 
ATPase subunit 6, 
cytochrome oxidase 
SUDunii j, seven, 
unidentified proteins, 
twentyfour LRNA's 
and L-rRN A. 


0.60 


<NONE> 


<NONE> 


<NONE> 


304 


LI 1172 


Plasmodium 
falciparum KjNA 
polymerase I gene, 
complete cds. 


0.60 


<NONE> 


<NONE> 


<NONE> 


305 


Z81079 


Caenorhabditis 
etegans cosmid 
rjyii 1 1 , complete 
sequence 
[Caenorhabditis 
elegans] 


0.60 


<NONE> 


<NONE> 


<NONE> 


306 


Z.49627 


S.cerevisiae 
chromosome X 
reading frame ORF 
YJRl27c 


0.60 


1 18751 


MAJUK DIN A-duNIJuNu 
PROTEIN herpesvirus I (strain 
11) >gi[60327 (X64346) major 
ssDNA-binding protein 
[Saimirnne herpesvirus 2] 


9.6 


307 


U94911 


Rattus norvegicus H- 
K-ATPase alpha 2 
gene, alternatively 
spuccu prouucis <tnu 
partial cds 


0.60 


2213862 


{nrKfijjVoO) rlovircL, 

[Plasmodium falciparum] 


7.4 


308 


U67476 


Methanococcus 
jannaschii section 18 
of 150 of the 
complete genome 


0.60 


1749688 


(D89240) unnamed protein 
product 


5.7 


309 


U675 1 3 


Methanococcus 
jannaschii section 55 
of 150 of the 
complete genome 


0.60 


3327421 


(U97068) zonadhesin [Mus 
musculus] 


4.3 


310 


U57817 


Haemophilus ducreyi 
lipoprotein gene, 
complete cds 


0.60 


4008577 


(AL034491) conserved 
hypothetical protein 
[Schizosaccharomyces pombe] 


2.5 
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Nearest Neiuhbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












probable protein- tyrosine kinase 




311 


X80700 


H.sapiens G17 gene 


0.60 


422541 


(EC 2.7.1.112) RTK - Pacific 
electric ray >gi|290858 


1.5 


312 


I 42 1 67 


Mus musculus (clone 
R24) rds gene, partial 


0.60 


4220843 


(AF033823) moira [Drosophila 
melanoeaster] 


0.51 


313 


U54777 


Human hMSH6 
mRNA, complete cds 


0.60 


2665637 


(AF031087) mismatch repair 
protein MSH6 [Mus musculus] 


5e-07 


314 


D86985 


Humnn mRNA for 
KIAA0232 gene, 
complete cds 


0.60 


1938462 


(U97006) No definition line 
found [Caenorhabditis eleaans] 


2e-07 


315 


D43964 


Rat liver mRNA for 
Kan- 1, complete cds 


0.60 


1280135 


coded tor by C. 
elegans cDNA cm21e6; coded 
for by C. elegans cDNA 
cm01e2; similar to melibiose 
carrier protein 

(thiomethylgalactoside permease 
11) 


5e-15 


316 


U49058 


Ratius norvegicus 
rTD-hindinc SR-like 
protein rA4 mRNA. 
partial cds 


0.60 


2145091 


(U37500) RNA polymerase II 
largest subunit [Mus musculus] 


le-19 


317 


X84388 


U.ruddi 

mi rrwhnnflri 1 1 lVS 
fill \Aj\- I IUI IvJI I Ul 1 t-<-J 

ribosomal RNA 


0.60 


3874247 


YZ70°05) predicted using 
Genefinder 


2e-37 


318 


AF 125447 


Caenorhabditis 
elegans cosmid 
YI4H12B 


0.59 


<NONE> 


<NONE> 


<NONE> 




319 


U20189 


Hyoscyamus mulicus 
clone cVS2 
vetispiradiene 
synthase mRNA, 
partial cds. 


0.59 


<NONE> 


<NONE> 


<NONE> 


320 


M63962 


Human gastric H,K- 
ATPase catalytic 
subunit gene, 
complete cds. 


0.59 


<NONE> 


<NONE> 


<N0NE> 


321 


AJ1 32366 


Helicobacter pylori 
(strain PI) comB and 
pmi/algA (partial) 
genes, and partial 
ORFI and ORF2 


0.59 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neiuhbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ShQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






JAus muse ul us 










322 


U I 7289 


transcription factor 
AP-2 (AP-2) gene, 
alternative exon la, 
and isoform 2, partial 
cds. 


0,59 


2459419 


(AC002332) hypothetical 
protein [Arabidopsis thaliana] 


9-4 


323 


Z71466 


S.cerevisiae 
chromosome XIV 
reading frame ORF 
YNL190w 


0.59 


3875542 


(Z67990) Similarity to Rat 
amiloride-sensitive sodium 
channel beta-subunit 


7.3 


324 


Z66493 


Beet soil-borne virus 
genes for 13K,22K 
and 48 K proteins 


0i9 


* 

2119867 


cryV465 protein - Bacillus 
thurinuiensis thuringiensisj 


■ 7.2 


325 


L41351 


Homo sapiens 
prostasin mKNA, 
complete cds 


0-59 


729212 


CRYST ALLEN JlC crystallin 
[Tripedalia cystophora] 


4.2 


326 


X79854 


S.lincolnensis gene 
for 16S ribosomal 
RNA 


0.59 


3702828 


(AF056577) high mobility 
group protein 1.2 


3.2 


327 


AJ223356 


Strongylocentrotus 
purpura tus mRNA for 
SuDp98 protein 


0.59 


2495704 


HYPOTHETICAL PROTEIN 
KIAA0129 product is novel. 
[Homo sapiens] 


2.5 


328 


X860I9 


H.sapiens mRNA for 
PRPL-2 protein 


0.59 


1743341 


(Y 10027) transcription factor 
TEF-1 [Mus musculus] 


2.5 


329 


U75528 


Xiphias gladius 
creatine kinase gene, 
partial cds 


0.59 


1845995 


(U69477) envelope glycoprotein 
[Human immunodeficiency virus 
type 1] 


2.4 


330 


AC005573 


Homo sapiens 
chromosome 5, PAC 
clone 202el3 


0.59 


2506366 


DNf A POLYMERASE 
EPSILON SUB UNIT B DNA- 
directed DNA polymerase (EC 
2.7.7.7) II chain B - yeast. 
(Saccharomyces cerevisiae) 
>gi|7863I9 (U25842) DNA 
Polymerase epsilon, subunit B 
(Swiss Prot. accession number 
P24482) [Saccharomyces 
cerevisiael 


1.4 


331 


L19180 


Rat receptor-linked 
protein tyrosine 
phosphatase 


0.59 


1235974 


(X96713) collagen [Globodera 
pallida] 


LI 


332 


L32090 1 


Listeria 

monocytogenes sec A 
gene, complete cds. 


0.59 


2291129 


(AF0I6415) No definition line 
found (Caenorhabditis elegansj 


0.83 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Xenopus laevis 






(AL031124) hypothetical 




333 


U24433 


syndecan-2 mRNA, 
complete cds. 


0.59 


3355692 


protein SCIC2.25c 
[Streptomyces coelicolorl 


0.64 


334 


M23412 


Drosophila 
muscarinic 

acetylcholine receptoi 
mRNA. complete cds. 


0.59 


168237 


(M76546) hydroxyproline-rich 
protein [Helianthus annuus] 


0.22 


jjj 


AF060729 


Synaphea media 
chloroplast atpB-rbcL 
intergenic spacer 
region, partial 
sequence 


0.59 


731596 


HYPU1HL11LAL6/.5 KJJ 
PROTEIN IN PRPS4-STE20 
INTERGENIC REGION 
>gi|626567|pir||S46825 
hypothetical protein YHLOlOc - 
yeast (Saccharomyces 
cerevisiae) >gi|2289881 
(U11582) No definition line 
found [Saccharomyces 
cerevisiae] 


0.16 


336 


AF029734 


Xanthobacter 
autotroph icus 
transcriptional 
activator AldR (aldR) 
gene, partial cds; and 
NAD-dependent 
chloroacet aldehyde 
dehydrogenase (aldB) 
gene, complete cds 


0.59 


2498801 


PERIAXIN 

>ei|2 14390I|pir||I58157 penaxm 
- rat >gi|505297 (Z29649) 
periaxin [Rattus norvesicus] 


0.13 


337 


X95307 


C.reinhardtii Ll8l8r- 
1 gene 


0.59 


1723781 


HYHJlHbllLALJ4. w j KJJ 

PDHTFTM TM TAP iA^ VHU 1 
rl\U 1 C.UN 11N 1 /VT l^O- I UK 1 

INTERGENIC REGION 
>gi|213I717|pir[|S646l2 
nypotneiicai protein iuKi//c - 
yeast (Saccharomyces 
cerevisiae) 

>gi|1323505|gnl|PID|e243248 
(Z73062) ORF YGR277c 
Saccharomyces cerevisiae] 


Ie-04 


338 


i 

M24572 


Dicryostelium 
discoideum tRNA- 
Glu-GAA gene, clone 
yGluGAAS. 


0.59 


1176186 


HYPOTHETICAL 43.3 KD 
GTP-BIND1NG PROTEIN IN 
DACB-RPMA INTERGENIC 
REGION >gi|606I21coli] 


3e-06 


339 


U73733 


-Juman hMSH6 gene. 
:xon 2 


0.59 


2665637 


[AF0310S7) mismatch repair 
protein MSH6 (Mus musculus] 


5e-07 



\4b 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 







Escherichia coli 










340 


D90747 


genomic DNA. (25.2 - 
25.6 min) 


0.59 


134286 


DOLICHOL KINASE 


6e-08 


341 


J052U 


Human desmoplakin 
mRNA. 3' end. 


0.59 


246796 


major centromere protein. 
CENP-B [human. Peptide, 594 
aa] 


4e-08 


342 


L24441 


Loligo pealii kinestn 
light chain mRNA, 
complete cds. 


0.59 


547800 


KINEStN LIGHT CHAIN 
(KLC) sea urchin 
(Strongylocentrotus purpuratus) 
>gi|16l530 


5e-14 


343 


M25I40 


Human caxdiac alpha- 
myosin heavy chain 
(MYH6) gene, exons 
2, 3 and 4. 


0.58 


* 

<NONE> 


<NONE> 


<NONE> 


344 


L81932 


Homo sapiens 
(subclone 9_h2 from 
PI H21) DNA 
sequence 


0.58 


<NONE> 


<NONE> 


<NONE> 


345 


AF087966 


Homo sapiens full 
length insert cDNA 
clone YU5IG04 


0.58 


<NONE> 


<NONE> 


<NONE> 


.346 


Z78574 


H. sapiens flow-sorted 
chromosome 6 TaqI 
fragment, 
SC6pA10Gll 


0.58 


<NONE> 


<NONE> 


<NONE> 


347 


AF068061 


Blattella germanica 
allatostatin 
neuropeptide 
precursor, gene, 

complete cds 

r — 


0.58 


<NONE> 


<NONE> 


<NONE> 


348 


AFO 15592 


Homo sapiens Cdc7 
(CDC7) mRNA, 
complete cds 


0.58 


<NONE> 


<NONE> 


<NONE> 


349 


AFO28O06 


Methanosarcina 
barkeri atp operon: 
ATP synthase beta 
subunit (atpD), ATP 
synthase epsilon 
subunit (atpC), ATP 
synthase gene 1 
(atpl). ATP synthase 
a subunit subunit (... 


0.58 


3184291 


(AC004136) putative DNA 
polymerase III gamma subunit 


9.4 


350 


ABO 17032 


Mus muse ul us gene 
for pancreatic trypsin, 
complete cds 


0.58 


3170561 


(AF056704) synapsin 11 la 
Rattus norvegicus] 


9.2 



1 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Dictyostelium 










351 


AF08I585 


discoideum 
developmental 
protein DG1 1 10 
(DGI110) gene, 
partial cds 


0.58 


105417 


basic proline-rich peptide IB-8a 
human 


9.2 


352 


AF086322 


Homo sapiens full 
length insert cDNA 
clone ZD53E01 


0.58 


93026 


hypothetical protein - African 
swine fever virus (strain Malawi 
Lil-20/1) >gi|450758(X71982) 
myeloid differentiation antigen 
homologue [African swine fever 
virus] >gi|903686 (M95672) 
unknown protein 


7.1 


353 


AF088025 


Homo sapiens full 
length insert cDNA 
clone ZC19C04 


0.58 


2384644 


(U92805) thrombospondin-3 
[Xenopus laevis] 


7.0 


354 


AB002339 


Human mRNA for 
KIAA0341 gene, 
partial cds 


0.58 


2135587 


Ml 30 antigen (cytosolic variant 
2) - human 


5.4 


355 


U67548 


Methanococcus 
jannaschii section 90 
of 150 of the 
complete genome 


0.58 


291 1094 


(AL021957) hypothetical 
protein Rv2174 


4.2 


356 


L07868 


Homo sapiens 
receptor tyrosine 
kinase (ERBB4) 
gene, complete cds. 


0.58 


461922 


PYRUVA1L 

DECARBOXYLASE (8-10 NM 
CYTOPLASMIC FILAMENT- 
ASSOCIATED PROTEIN) 
(P59NC) 4.1.1.1) - Neurospora 
crassa >gi|293948 (L09125) 
pyruvate decarboxylase 
[Neurospora crassa] 
>gi| 1655909. 


4.2 


357 


X03897 


Bacillus subtilis 
sigma 43 operon with 
P23-dnaE-rpoD genes 
(dnaE for DNA 
primase, rpoD for 
RNA polymerase) 


0.58 


1323704 


(U55387) similar toC elegans 
F38E1.9 gene product encoded 
by GenBank Accession Number 
U41996 [Cricetulus eriseus] 


4.1 


358 


D76419 


Desulfovibrio 
vulgaris rbo gene for 
desulfoferrodoxin and 
rub gene for 
rubredoxin, complete 
cds 


0,58 


3420047 


(AC004680) putative protein 
kinase [Arabidopsis thaliuna] 


2.4 
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Nearest Neighbor (BlastN vs. Gcnbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human DNA 










359 


Z82174 


sequence from 
cosmid B20F6 on 
chromosome 22, 
complete sequence 
[Homo sapiens] 


0.58 


2145455 


(Y07866) catalase- peroxidase 


2.4 


360 


M33642 


F.solani STI35 
protein gene, 
complete cds. 


0.58 


2896706 


(AL021897) hypothetical 
protein Rv 1069c 


2.4 


361 


U64873 


Mus musculus 
transforming growth 
factor alpha (TGF 
alpha) gene, partial 
cds 


0.58 


3874437 


(Z81038) predicted using 
Gene finder; cDNA EST 
yk488a2.5 comes from this gene 
[Caenorhabditis efegans] 


1.8 


362 


AB002I32 


Macroph thai mus 
banzai mitochondrial 
DNA for I2S and 
16S rRNA, partial 
and complete 
sequence 


0.58 


2960022 


(AJ224676) rho type GEF 
[Drosophila melanogaster] 


1.8 


363 


AF070070 


Caenorhabditis 

elegans MutS 
homolog (msh-5) 
mRNA. partial cds 


0.58 


4098205 


(U75S69) Omp22 [Helicobacter 
pylori] 


1.8 


364 


AF045240 


Staphylococcus 
epidermidis plasmid 
3IP1629 mobilization 
Drotein (mobC!), 
(orf69-i), (mobAl), 


0.58 


4218117 


AL035353) protein [fragment) 


0.62 


365 


X61637 


^.sapiens Wilms 
tumor gene L exons 8 
and 9 


0.58 


2331059 


(U88211) unknown [Gallus 
gallus) 


0.62 


366 


AF039312 


VIoraxella catarrhalis 
strain 4223 transferrin 
sinding protein A 
[tbpA) and transferrin 
binding protein B 
(tbpB) genes, 
complete cds; and 
unknown sene 


0.58 


120155 


FIBER PROTEIN 
>gi|74229|pir|]ERADFM fiber 
?rotein - mouse adenovirus 1 
>gi|209758 (M30594) fiber 
xoiein [Mastade no virus musl] 


0.27 


367 


D87463 


Iuman mRNA for 
K1AA0273 gene, 
complete cds 


0.58 


3861477 


JJ94 1 77) androgen receptor 
Pan troglodytes] 


0.12 


363 


U40342 


Vlus musculus ninein 
mRNA. complete cds. 


0.58 


4115936 


(AF1 18223) No definition line 
: ound [Arabidopsis thaliana] 


0.004 
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Nearest Neighbor CBIastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















369 


S57235 


CD68=M0kda 
transmembrane 
glycoprotein [human, 
promonocyte cell line 
U937, mRNA, 1722 
ml 


0.58 


2072501 


(U96113) WWPI [Homo 
sapiens] 


le-04 


370 


U39391 


Mus musculus 
serotonin 1 A receptor 
mRNA, complete cds. 


0.58 


1469876 


(D63481)The KIAA0147 gene 
product is related to adenylyl 
cyclase. [Homo sapiensl 


le-07 


371 


D00056 


Monkey B- 
Iymphotropic 
papova virus genes for 
VP-I,2, 3 and large 
T antigen, complete 
and partial cds, strain 
LPV-76 > :: 
gb|MI4494|PPMVPl 
M Monkey B- 
lymphotropic 
papovavirus mutant 
(LPV-76) PstI B 
fragment encoding 
VPK VP2. VP3 and 
T-antisen. 


0.58 


2462069 


(AJ001774) vanadium 
chloroperoxidase 


le-08 


372 


M77182 


Amsacta 
emomopoxvirus 
spheroidin gene, 
complete cds, and 

orfs. > :: 

gb|I 16670)116670 
Sequence 1 from 
patent US 5476781 


0.58 


1730722 


H Y Yu 1 Hh' TICAL 4 3 S KJL) 
PROTEIN IN NCE3-HHT2 
INTERGENIC REGION 
>gi|213I871|pir||S62957 
hypothetical protein YNL035c - 
ycuM ^Oactnaj uniyLCi 
cere vis iae) 

>gi|1301880|gnl|PID|e239670 
(27131 l)ORF YNL035c 
[Saccharomyces cere vis iae] 


8e-14 


373 


S72579 


igIoo-S=growth- 
associated protein 
GAP-43 homolos 


0.58 


2689720 


(AF037168) DnaJ homologue 
[Arabidopsis thaliana] 


7e-14 


374 


AF018165 


Tctraodon fluviatilis 
amyloid precursor 
>rotein mRNA. 
complete cds 


0.58 


3219938 


HYPOTHETICAL 34.9 KD 
PROTEIN C57A10.1 IC IN 
CHROMOSOME I 
>gi|2058378|gnl|PID|e3 1 4002 
pombe] 


5e-22 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















375 


U81S03 


Filobasidiella 
neoformans 
translation elongation 
factor EF1 -alpha 
(CnTEFl) mRNA, 
complete cds 


0.57 


<NONE> 


<NONE> 


<NONE> 


376 


U09781 


Candida albicans 
ATCC 18804, CBS 
562 peptide 
transporter gene, 
complete cds. 


0.57 


- <N0NE> 


<NONE> 


<NONE> 


377 


AC002143 


Homo sapiens 
(subclone 4_bl0 from 
BACH 102) DNA 
sequence 


0.57 


<NONE> 


<NONE> 


<NONE> 


378 


U23442 


Tetrahymena 
thermophila RR 
internal deletion 
sequence. 


0.57 


<NONE> 


<NONE> 


<NONE> 


379 


U172S9 


Mus musculus 
transcription factor 
AP-2 (AP-2) gene, 
alternative exon la. 
and isoform 2, partial 
cds. 


0.57 


<NONE> 


<NONE> 


<NONE> 


380 


X70844 


Buzura suppressaria 
nuclear polyhedrosis 
virus gene for 
polyhedrin protein 


0.57 


<NONE> 


<NONE> 


<NONE> 


381 


AJ012159 


Homo sapiens 5T4 
oncofetal trophoblast 
glycoprotein gene 


0.57 


<NONE> 


<NONE> 


<NONE> 


382 


X76571 


H.sapiens simple 
DNA sequence region 
clone wglaS. 


0.57 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 



Nearesi Neishbor (BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



383 AF034434 



384 AB0I7031 



Viiiiiu chuldrae 

pathogenicity island, 
putative transposase, 
aldehyde 
dehydrogenase 
(aldA), toxR- 
aciivated gene A 
protein (tagA), 
putative inner 
membrane protein, 
and putative zinc 
metalloprotease 
genes, complete cds; 
and... 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



Mus musculus gene 
for TESP4, complete 
cds 



P VALUE 



0.57 



<NONE> 



0.57 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



385 



X89788 



S.hispidus 

mitochondrial DNA 
for SSU ribosomal 
RNA cene 



386 



L 16921 



Rat progesteron 
receptor gene, 5* 
untranslated resion. 



0.57 



<NONE> 



<NONE> 



<NONE> 



0.57 



3323116 



(AE001251)femA protein. 
putative [Treponema pallidum] 



8.9 



387 AF027292 



Homo sapiens 
interferon regulatory 
factor 6 



0.57 



259790 



388 I AJ012581 



Cicer arietinum 
mRNA for 
cytochrome P450 



(S48157) DNA polymerase- 
primase ISO kda subunit 
[Drosophila melanogaster. 
Peptide, 1490 aa] 



6.7 



0.57 



2131498 



hypothetical protein YDR446w - 
yeast CAI: 0.1 1 [Saccharomyces| 
cerevisiae] 



5.3 



389 



L15363 



Human transfer RNA- 
Met (TRMEP1) 
pseudogene, complete 



sene 



0.57 



3228680 



(AF070935) GABA receptor 
subunit [Musca domestical 



5.2 



390 I AE000525 



Helicobacter pylori 
26695 section 3 of 
134 of the complete 
senome 



0.57 



1938478 



(U97008) weak similarity to 
amrly I of G-protein coupled 
receptors [Caenorhabditis 
elegans] 



4.0 



391 I AF020189 



Amblyomma 
americanum 
ecdystcroid receptor 
(AamEcR) mRNA, 
3'UTR. region 1 



0.57 



2072224 



U94875) p40 [Borna disease 
virus] ■ 



4.0 



5a 



WQ0102568 We://C:\Docum erts and SettinqsMawrenceMocal SettinqsVTem porary In ternet F iles\OLK llMQQl 02568 - c P c L 



Pag e 354 of 1046 



WO 01/02568 



PCT/US00/18374 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX v S . Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human UbA52 gene 










392 


X56997 


coding tor ubiquitin- 
52 amino acid fusion 
protein 


0.57 


2960113 


( ALO^ip H hvDOthetical 
protein Rv3689 


4.0 


393 


AJL0 10260 


Plasmodium 
falciparum DNA *** 
SEQUENCING IN 
PROGRESS *** 
from contig 4-8 L, 
complete sequence 


0.57 


L 17233 


Cytochrome mvic u 

(CYPIIC14) phenobarbitai- 
inducible, hepatic - rabbit P-450 
[Oryctolagus cuniculus] 
>gi|358265|prfl|1306317A 
cytochrome P450 [Oryctolagus 
cuniculus] 


3.9 


394 




Xenopus laevis 
gamma-crystallin 
(gcrv3) gene. 


ft 57 




GASTRULA ZINC FINGER 
PROTEIN XLCGF44.2 
>gi|85736|pirl|S06571 finger 
protein (clone XlcGF44-2) - 

rvincafi l luwcu iru^: ^ i ru^rncni / 


J.KJ 


395 


M38384 


Drosophila 
melanogaster seven in 
absentia mRNA, 


0 57 


I 707 1 77 


(U80454)T16AI.I 

lV*4lwIlUI fltlLJUlLlO CIC£,ullof 




396 




Haemophilus 
influenzae Rd section 
110 of 163 of the 

v\Jl 1 1 pit it g,t 1 l\Jt 1 lC 


0 57 




IRON(I1I)-TRANSPORT 
SYSTEM PERMEASE 
PROTEIN SFUB >gi|152S61 

fM^^l 5\ nmrptn fcnfR\ 
VlVljjOUl piulCIIl \»U1D; 


7 1 


397 


XI 2600 


Klebsiella 
pneumoniae nifX* 
nifU, nifS, nifVand 
nifW genes 


0.57 


2909562 


(ALCP1925) hypothetical 
protein Rv2256c 


1.4 


398 


ABO 14526 


Homo sapiens mRNA 
for KIAA0626 
protein, complete cds 


0.57 


482390 


insect-stage-specific protein - 
Trypanosoma cruzi >gi| 162099 
'M65021) insect stage-specific 
antigen 


0.61 


399 


AF063587 


Rhodococcus fascians 
strain NRRL-B- 
15096 hypothetical 
protein gene, 
complete cds 


0.57 


4104321 


;AF034582) vesicle associated 
protein [Rattus norvesicus] 


0.46 


400 


L11U7 


Guinea pig estrone 
sulfotransferase gene. 


0.57 


82584 


alpha/beta-gliadin precursor 
(clone A2 12) - wheat 


0.35 
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Nearest Neishbor (BlasuN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















V 

401 


V00829 


Mouse compieie gene 
for a mouse kallikrein 
gene. Genes are mGK 
1 (complete gene) 
and mGK-2 of 
hormones, e.g., 
grow... > :: 
gb|J00390|MUSKAL 
07 Mouse pseudo- 
kallikrein 2, exons 4 
and 5, and kallikrein 
1 gene, complete cds. 


0.57 


2500916 


NUCLEAR HORMONE 
RECEPTOR NOR-2 receptor 
[Rattus norvegicusl 
>gi(1583604(prfpl21281A 
NOR-2 protein [Rattus 
norvegicus] 


0.20 


402 


X53092 


Chicken mRNA for 
beta-2 subunit of 
neuronal nicotinic 
acetylcholine receptor 


0.57 


1072256 


(U40953) similar to matrin F/G 
(SP:Q00910) containing C4- 
type zinc-fingers 
[Caenorhabditis eleaans) 


0.031 


403 


L07939 


Ovis ovis granulocyte 
colony stimulating 
factor 


0.57 


3874345 


<ZiSiU3^} predicted using 
Oenerinder; iimnanty to 
dehydrogenases; cDNA EST 
EMBL:D65800 comes from this 
gene; cDNA EST 
EMBL:D76184 comes from this 
gene; cDNA EST 
EMBL:D69322 comes from this 
gene; cDNA EST 
EMBL:C08I58 comes f... 


3e-07 


404 


U1806I 


Colletotnchum 
gloeosporioides 
CAP20 (cap20) gene, 
complete cds. 


0.57 


2914695 


(AC003974) putative ubiquitin 
specific protease 


9e-08 


405 


273955 


L japonicus mRNA 
for small GTP- 
binding protein, 
RABIIG 


0.57 


1 12894 


TUMOR NLLKUMb HAL 1 UK, 
ALPHA- INDUCED PROTEIN 
3 (PUTATIVE DNA BINDING 
PROTEIN A20) (ZINC 
FINGER PROTEIN A20) 
>gi|107549|pir||A35797 
probable DNA-binding protein 
A20- human >gi|177S66 
(M59465) A20 


7e-08 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 

ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















406 


X04335 


Petunia grp-1 gene 
for glycine-rich 
protein 


0.57 


- 

3876901 


(Z//66U) Similarity to Human 
enoyl-CoA hydrolase 
(SW:ECHM_HUMAN); cDNA 
EST EMBL:T0061 1 comes 
from this gene; cDNA EST 
yk203dl0.3 comes from this 
gene; cDNA EST yk203dl0.5 
comes from this gene; cDNA 
EST yk457h5.3 comes from t... 


le-27 


407 


U407I8 


Rattus norvegicus S- 
adenosy [methionine 
decarboxylase 
(AMDP2) 
pseudogene 


0.56 


<NONE> 


<NONE> 


<NONE> 


408 


M60318 


S.cerevisiae SSD1 
protein gene, 
complete cds. > :: 
gb|AR013983|AR0l3 
983 Sequence 8 from 
patent US 5773245 


0.56 


<NONE> 


<NONE> 


<NONE> 


409 


X60057 


Nicotiana tabacum 
blp4 mRNA for 
luminal binding 
protein (BiP) 


0.56 


<NONE> 


<NONE> 


<NONE> 


410 


AF085930 


Homo sapiens full 
length insert cDNA 
clone YR55A09 


0.56 


<NONE> 


<NONE> 


<NONE> 


411 


AL010189 


Plasmodium 
falciparum DNA *** 
SEQUENCING IN 
PROGRESS*** 
from contig 3-102, 
complete sequence 


0.56 


<NONE> 


<NONE> ' 


<NONE> 


412 


X05402 


Murine G-CSF gene 
for granulocyte 
colony stimulaling 
factor precursor 


0.56 


<NONE> 


<NONE> 


<NONE> 


413 


U92280 


Rattus norveeicus 
regulator of G- protein 
signalling 12 
(RGS12) mRNA, 
complete cds 


0.56 


<NONE> 


<NONE> 


<NONE> 


414 


US5660 


-luman 

japillomavirus strain 
RTRX7 complete 
eenome 


0.56 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















415 


X57626 


M. javanica 
mitochondrion 
ATPase 6, and 
putative tRNA-f-Met 
and tRNA-His genes 


0.56 


<NONE> 


<NONE> 


<NONE> 


416 


AB003363 


Sus scrofaSlOOC 
gene, complete cds 


0.56 


<NONE> 


<NONE> 


<NONE> 


417 


L42291 


Danio rerio DANA 
element, intron 4. 


0.56 


2650002 


(AE001062) conserved 
hypothetical protein 
[Archaeoelobus fulaidus) 


8.7 


418 


AF031826 


Mus musculus 
leukocystatin gene, 
complete cds 


0.56 


462493 


L-LAC iA'IL 
DEHYDROGENASE 
(IMMUNOGENIC PROTEIN 
P36) >gi|479296|pir||S33362 L- 
lactate dehydrogenase (EC 
1.1.1.27) - Mycoplasma 
hyopneumoniae 


6.7 


419 


U I 7068 


Pennisetum glaucum 
Ac-like element. 
AcL2. 


0.56 


399449 


ESC AKGOT/S NAIL PROTEIN 
HOMOLOG 


6.7 


420 


Z48042 


H.sapiens mRNA 
encodins GPI- 
anchored protein 
pl37 


0.56 


141232 


HYPOTHETICAL 8.7 KD 
PROTEIN (READING FRAME 
D) >gi|76316ipir||QQSA7C 
hypothetical protein E-74 


6.7 


421 


AF027657 


(Jhonstoneura 
fumiferana 
entomopoxvirus 
nucleotide 
triphosphate 
phosphohydrolase I 
(NPHI) gene, 
complete cds 


0.56 


464999 


PUTATIVE 
ACETYLCHOLINE 
REGULATOR UNO IS 
>gi|480359|pir||S36747 
acetylcholine regulator unc- 18 - 
Caenorhabditis elegans 
>gi|247392|bbs| 100294 putative 
acetylcholine regulator unc- 18 


5.1 


422 


AB011540 


Homo sapiens mRNA 
for MEGF7, partial 
cds 


0.56 


1718033 


URACIL-DNA 
GLYCOSYLASE (UDG) 
herpesvirus 2 >gi|695219 
(U20824) uracil DNA 
glvcosylase 


5.1 


423 


X59941 


X.maculatus NGF 
gene for nerve growth 
factor 


0.56 


1169081 


COMMON PLANT 
REGULATORY FACTOR 
CPRF-I >gi|5 15621 (X58575) 
light-inducible protein CPRF-1 
[Petroselinum crispum] 
>ei|149S301 (U46217) CPRF1 


3.S 



WO0 1025 68 ifile://C:\Documep ts and SetUngs\Lawrenc^ 



Page 358 of 1046 



WO 01/02568 



PCT/US00/18374 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















424 


M727 1 1 


Rat transcriptional 
repressor of my el in- 
specific genes (SCIP) 
mRNA. complete cds. 


0.56 


501027 


(U01849) ORF2 [Trypanosoma 
brucei] 


2.3 


425 


AL023850 


Caenorhabditis 
elegans cosmid 
Y67D11A, complete 
sequence 
[Caenorhabditis 
elegans] 


0.56 


266771 


CHUK1SMA ltMUl A5fc 

(CM) / PREPHENATE 
DEHYDRATASE (PDT) (P- 
PROTEIN) 

>gi|281791|pir||S26053 
chorismate mutase (EC 5.4.99.5) 
P / prephenate dehydratase (EC 
4.2. L51) - Erwinia herbicola 
>gi|43344 


2.3 


426 


U47862 


Schistosoma mansoni 
gynecophoral canal 
protein mRNA, 
complete cds 


0.56 


2147138 


ATP synthase chain 6 - 
Platymonas subcordiformis 
mitochondrion >gi|633582 
(Z47797) ATP synthase subunit 
6 [Platymonas subcordiformis] 


2.3 


427 


V00574 


Human germ line 
gene homologous to 
bladder carcinoma 
oncogene T24 (Gene 
code c-Ha-ras-1) with 
four exons. 


0.56 


1518672 


(U60289) receptor protein 
tyrosine phosphatase psi [Homo 
sapiens) 


1.7 


428 


Z71502 


X-Iaevis Hl(0)-1 
eene 


0.56 


1651674 


(D90899) ferrichrome-iron 
receptor 


1.3 


429 


M37278 


R.norvegicus renin 
gene, exons 1-9. 


0.56 


2853019 


(AF045141) putative serine 
proteinase [Scirpophaga 
incertulas] 


1.0 


430 


D28878 


Thermus 

thermophilus polA 
gene for thermostable 
DNA polymerase I, 
complete cds 


0.56 


3659692 


(AF06874S) sphingosine kinase 
[ivlus musculus] 


0.77 


431 


Z 15027 


H.sapiens HLA class 
III DNA 


0.56 


1304141 


(D43758) fibrinogen A-alpha- 
chain 


0.76 


432 


M 14362 


Human T-cell surface 
antigen CD2 (Til) 
mRNA, complete cds. 


0.56 


2462979 


(YI 1915) Tenascin-X [Bos 
taurus] 


0.59 


433 


Z50S0I 


Z.mays mRNA tor 
chlorophyll a/b- 
bindina protein CP29 


0.56 


109677 


collagen alpha 1(1) chain - 
mouse >gi|50487 


0.50 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












HYPOTHETICAL 86.0 KI> 




A 1A 


Z38II4 


S.cerevisiae 
chromosome XIII 
cosmid 9745 


0.56 


140372 


PROTEIN IN GLK1-SR09 
INTERGENIC REGION 
>gi|83159|pir||S 19367 
hypothetical protein YCL039w - 
yeast tpaccna/omyces 
cerevisiae) 


0.35 


435 


AF052254 


Escherichia com UNA 
gyrase A (gyrA) gene, 
partial cds 


0.56 


2724126 


(AF038535) synaptotagmin VII 
[Homo sapiens] 


0.12 


436 


AF080649 


Jeguia puiugo Ij^o 
small subunit 
ribosorna! RNA gene, 
mitochondrial gene 
for mitochondrial 
RNA, partial 
sequence 


0J6 


3913223 


CYCLIN-DEPENDENT 
KINASE INHIBITOR 1 
p21/WAFl [Feliscatus] 


0.11 


437 


AJ005690 


Danio rerio mRNA 
for protein tyrosine 
kinase 


0.56 


2623830 


(Ar 030962) unknown 
[Schistosoma mansoni] 


7e-06 


438 


U31202 


Human noggin 
(NOGGIN) gene, 
complete cds. 


0.56 


3875475 


(Z7841 1) F02D8.3 
[Caenorhabditis elegans] 


1 C-UD 


439 


X51695 


Ovis sp. trichohyalin 
mRNA. partial 


0.56 


3386622 


(AC004665) unknown protein 
[Arabidopsis thaliana] 


le-10 


440 


U28938 


Rottus norvegicus 
protein tyrosine 
phosphatase D30 
mRNA, complete cds 


0.56 


3293547 


(AF072709) putative 
oxidoreductase [Streptomyces 
hvidans] 


te- 1 -*> 


441 


AfcUQl 171 


Borrelia burgdorferi 
(section 57 of 70) of 
the complete genome 


u.jo 


ZJ UJJ. 1 


(AF016452) similar to the beta 


4e-16 


442 


AF036685 


Caenorhabditis 
elegans cosmid 
C05BI0 


0.56 


1519671 


(U67951) contains similarity to 
ATP/GTP-binding site motif ^ 
(PS:PS000I7) [Caenorhabditis 
elesans] 


6e-20 


443 


X01173 


Xenopus laevis 
vitellogenin gene A I 
5* flanking resion 


0,56 


i 1 1 S 1 02 


(U4155S) K02B2.3 gene 
product (Caenorhabditis 
elegans) 


2e-3I 


444 


D10911 


Mus musculus DNA 
for MS2 protein, 
complete cds 


0.55 


<NONE> 


<NONE> 


<NONE> 



7 



J 



>9i 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 






P VALUE 


ACCbSSION 


DESCRIPTION 


P VALUE 






Rice mRNA EN 117, 











445 


D30010 


partial sequence 


0.55 


<NONE> 


<NONE> 


<NONE> 


446 


U51991 


Escherichia col i 
phosphoprotein 
phosphatase 


0.55 


<NONE> 


<NONE> 


<NONE> 


447 


M18858 


Mouse T cell receptor 
C -gam ma- 7. 1 mRNA, 
3' end. 


0.55 


<NONE> 


<NONE> 


<NONE> 


448 


U95218 


Homo sapiens T cell- 
death associated 
protein gene, 
complete cds 


0.55 


<NONE> 


<NONE> 


<NONE> 


449 


M 14948 


Human R-ras gene,' 
exon I. 


0.55 


<NONE> 


<NONE> 


<NONE> 


450 


AB002353 


Human mRNA for 
KIAA0355 gene, 
complete cds 


0.55 


<NONE> 


<NONE> 


<NONE> 


451 


LSI 689 


Homo sapiens 
(subclone l_d6 from 
PI H54) DNA 
sequence 


0.55 


<NONE> 


<NONE> 


<NONE> 


452 


M68955 


Human myristoylated 
alanine-rich C- kinase 
substrate (MACS) 
gene, 5' end. 


0.55 


3322710 


(AE00I220) V-type ATPase, 
subunit B (atpB-1) [Treponema 
pallidum] 


5.0 


453 


X62953 


R.norvegicus mRNA 
<pJGU6) with 
repetitive elements 


0.55 


1076802 


extervsin-like protein - maize 
>gi|600118mays] 


5.0 


454 


L34630 i 


Synechocystis sp. 

mfitARi tmncr>nrtf»r 

IflllwAUv*- U all>>LAJl Id 

system: periplasmic- 
i>inding protein 
^mntC), complete cds; 
^mntA) gene, 
complete cds; 
membrane protein 
'mntB) gene, 
complete cds. 


0.55 


2117632 


hydrogen dehydrogenase (EC 
1.12.1.2)- Clostridium 
acetobutylicum >gi|557064 
(U 15277) hydrogenase I 
[Clostridium acetobutylicuml 


5.0 


1 455 


U43521 


Plasmodium berghei 
merozoite surface 
jrotein-1 gene, 
complete cds 


0.55 


127654 


MYOGLOBIN 


4.9 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neishbor fBlastX vs. Non-Redundant Proteins ) 


SEQ 
ID 


ACCESSION 


DboCRir 1 IUIn 


P VALUfc 




DESCRIPTION 


d \; \ I t re 








... 








456 


264937 


Rsapiens CpG DNA, 
clone 17g7, reverse 
read cpgl7g7.rtla . 


0.55 


417298 


MFS 18 PROTEIN 
PRECURSOR 


1 o 


457 


U10914 


Macaca mulatta clone 
irh83 T-cell receptor 
alpha chain mRNA, 
partial cds. 


0.55 


310406 


(L09212) tat protein [Simian 
immunodeficiency virus] virus] 


3.8 


458 


AF022838 


Homo sapiens 
multidrug resistance 
protein 


0.55 


1585251 


traB gene [Amycolatopsis 
methanolica] 


2.8 


459 


M35603 


Mouse Hox-3.1 gene 
and Hox-3.2-Hox-3.1 
intergenic region. 


0.55 


818849 


(U25430) nucleotide 
pyrophosphatase precursor 
[Oryza saliva] 


2.0 


460 


AE001395 


Plasmodium 
falciparum 
chromosome 2, 
section 32 of 73 of 
the complete 
sequence 


0.55 


137532 


PROTEIN C2 

>gi|74386|pir||W2V2B6 59K 
Hindlll-C protein - vaccinia 
virus (strain WR) 


1.7 


461 


AE001395 


Plasmodium 
falciparum 
chromosome 2, 
section 32 of 73 of 
the complete 
sequence 


0.55 


137532 


PROTEIN C2 

>gi|74386|pir||WZVZB6 59K 
Hindlll-C protein - vaccinia 
virus (strain WR) 


1 "7 

1 . / 


462 


U59736 


Human transcription 
factor (NFATc.b) 
rnRNA, complete cds 


0.55 


3327144 


(ABO 14565) KIAA0665 protein 
fHomo sapiens] 


0.096 


463 


U34860 


Saccharomyces 
cerevisiae origin 
recognition complex 
large subunit (ORC1) 
gene, complete cds 


0.55 


140372 


HYPOTHbllLAL kJj 
PROTEIN IN GLK1-SR09 
INTERGENIC REGION 
>gi|83159|pir||Sl9367 
hypothetical protein YCL039w - 
yeast (Saccharomyces 
cerevisiae) 


0.017 


464 


AF012341 


Homo sapiens 
glutaryl-CoA 
dehydrogenase 
(GCDH) gene, exons 
6.7. 8, 9, and 10 


0.55 


1166611 


(U46674) coded tor by C. 
clegans cDNA yk27d9.5; coded 
for by C. elegans cDNA 
yk27d9.3: short region of weak 
homology to drosophilia 
suppressor of sable protein 


O.OOS 
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SEQ 
ID 



Nearest Neighbor (BlasiN vs. Genbank) 



ACCESSION 



DESCRIPTION 



465 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



AF004891 



HIV- 1 isolate QM- 



DESCRIPTION 



CxA from Kenya, 
envelope 

glycoprotein C2 V3 
region (env) gene, 
partial cds 



0.54 



P VALUE 



<NONE> 



<NONE> 



<NONE> 



466 



Y10159 



Ddiscoideum 
racOAP gene 



0.54 



<NONE> 



<NONE> 



<NONE> 



467 



AB001895 



Homo sapiens mRNA 
for B 120, complete 
cds 



0.54 



<NONE> 



<NONE> 



<NONE> 



468 



X 12357 



Bovine gene tor 
as party 1 protease 
NM 1 exons 3 and 4 > 
lcl|X 12357 Bovine 
aspartyl protease 
NMl gene, exons 3 
and 4. 



469 



AE001151 



Borrelia burgdorferi 
^section 37 of 70) of 
he complete genome 



470 



X92052 



H.sapiens mRNA for 
T cell receptor alpha 
chain 



471 



U00938 



vlus musculus ileal 
ipid-binding protein 
gene, complete cds 



472 



X68367 



Vl.thermoformicicum 
complete plasmid 
pFZl DNA 



473 



Z61098 



H.sapiens CpG DNA, 
lone 44c4. reverse 
read cpg44c4.rtla 



474 



M63962 



-luman gastric H,K- 
ATPase catalytic 
subunit gene, 

omplete cds. 



475 



XS6019 



H.sapiens mRNA for 
PRPL-2 protein 



0.54 



<NONE> 



<NONE> 



<NONE> 



0.54 



<NONE> 



<NONE> 



<NONE> 



0.54 



<NONE> 



<NONE> 



<NONE> 



0.54 



1009712 



(U27698) calreticulin 
Arabidopsis thaliana] 



4.9 



0.54 



125272 



CASEIN KINASE 11, ALPHA 
CHAIN (CK II) 
>gi|419938|pir||A43297 casein 
cinase II (EC 2.7.1.-) alpha 
chain - Theileria parva 
>gi|161871 (M92084) casein 
kinase II alpha subunit 
Theileria parva] 



4.7 



0.54 



4191274 



(AJ131094) X vent-IB protein 
Xenopus laevis] 



3.7 



0.54 



3881648 



'270757) similar to serine 
protease inhibitor 
Caenorhabditis elesans] 

_ _ ^ * 



3.7 



0.54 



164882S 



D87963) ETF-related factor- 1 
(ETFR-1) 



2.1 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






S.glaucescens genes 










476 


X890I0 


strU, strX, strV and 
strW for 5'- 
hydroxystreptomycin 

pruduction and 

transport 

polypeptides 


0.54 


3550345 


(AF084524) cellular repressor 
of El A-stimulated genes CREG 
[Mus musculus] 


0.25 


477 


AB007836 


Homo sapiens mRNA 
for Hic-5. partial cds 


0.54 


1097213 


ORF 1 [Streptomyces 
lavendulae] 


0.15 


• 

47S 


U32622 


Comamonas 
testosteroni TsaR 
(tsaR), 

toluenesulfonate 
methyl- 

monooxygenase 
oxygenase component 
component (tsaB), 
toluenesulfonate zinc- 
indepedent alcohol 
dehydrogenase... 


0.54 


3875351 


(Z96047) DY3.6 
[Caenorhabditis elegans] 


0.006 


479 


D61394 


Arabidopsis thaliana 
gene for beta-VPE, 
complete cds 


0.53 


<NONE> 


<NONE> 


<NONE> 


480 


D61394 


Arabidopsis thaliana 
gene for beta-VPE, 
complete cds 


0.53 


<NONE> 


<NONE> 


<NONE> 


481 


Z33072 


vl.capricolum DNA 
for CONTIG MC097 


0.53 


<NONE> 


<NONE> 


<NONE> 


4S2 


U45975 


-luman 

jhosphatidy (inositol 
'4,5)bisphosphate 5- 
Dhosphatase homolog 
mRNA, partial cds. 


0.53 


<NONE> 


<NONE> 


<NONE> 


483 


Z71324 


S.cerevisiae 
chromosome XIV 
reading frame ORF 
YNL04Sw 


0.53 


2135586 


Ml 30 antigen (cytosolic variant 
1) - human 


2.1 


484 


L32090 


listeria 

monocvtosenes secA 
gene r complete cds. 


0.53 


2291129 


(AF016415) No definition line 
found [Caenorhabditis elegans j 


0.70 



VP- 
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Nearest Neighbor f BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundam Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Mus musculus mRNA 










485 


D86423 


for HGT keratin, 
partial cds 


- — —< 
0,53 


1235974 


(X967I3) collagen [Globodera 
pallida) 


0.41 


486 


Y I 5969 


Mus musculus V 
kappa 21-6 gene, 
partial 


0.52 


<NONE> 


<NONE> 


<NONE> 


487 


M27480 


Mus musculus (clone 
3F9) transcribed 
germline T cell 
receptor gamma chain 
(Tcr-g) mRNA, VJ4 
C4 region. 


0.52 


3875542 


(267990) Similarity to Rat 
amiloride-sensitive sodium 
channel beta-subunit 


4.6 


488 


D87004 


Human (lambda) 
DNA for 

immunogloblin light 
chain 


0.52 


1766073 


(U37272) winged helix protein 
CWH-1 [Callus gallus] 


3.5 


489 


Z99704 


Human DNA 
sequence from 
cosmid E75B8 on 
chromosome 22, 
complete sequence 
[Homo sapiens 1 


0.51 


<NONE> 


<NONE> 


<NONE> 


490 


U76523 


Sambucus nigra lectin 
precursor mRNA, 
complete cds 


0.51 


<NONE> 


<NONE> 


<NONE> 


491 


U32795 


Haemophilus 
influenzae Rd section 
1 10 of 163 of the 
complete genome 


0.50 


<NONE> 


<NONE> 


<NONE> 


492 


M 14602 


Human myoglobin 
sene, exon 2. 


0.49 


478384 


lelicase homologglOL protein - 
African swine fever virus 
>gi|41409l (X7295I)G10L 125 
KDa protein 


7.0 


493 


D87075 


Human mRNA for 
KIAA0238 gene, 
partial cds 


0,24 


1938429 


(tT970O2) similar to 
Schizosaccharomyces pombe 4- 
nitrophenylphosphatase 
(PNPPASE) (SP:Q00472 t 
NID:g5004) [Caenorhabditis 
elegans] 


2.5 


494 


U95I02 


Xenopus laevis 
mitotic 

phosphoprotein 90 
mRNA. complete cds 


0.23 


<NONE> 


<NONE> 


<NONE> 



30 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


UboCKlrl 1UN 


P VALub 


ACv^fcoolUN 




r VAJLUb 






N.crassa 










495 


J05254 


mitochondrial small 
(I9S) rRNA and Cys- 
tRNA. 


0-23 


192150 


(L05670) clustrin [Mus 
musculus] 


5.1 


496 


X 16399 


Gene for glutamate 
dehydrogenase (EC 
1.4. 1.4), put. bacterial 
orisin 


0.23 


• 

790933 


(L07867) invariant surface 
glycoprotein [Trypanosoma 
brucei] 


0.030 


497 


AE001251 


Treponema pallidum 
section 67 of 87 of 
the complete genome 


0.22 


. . <NONE> 


<NONE> 


<NONE> 


49S 


AF0269I9 


Homo sapiens 
amyloid lambda light 
chain variable region 
mRNA, partial cds 


0.21 


<NONE> 


<NONE> 


<NONE> 


499 


227247 


D.melanogaster 
mRNA for defensin 


0.21 


<NONE> 


<NONE> 


<NONE> 


500 


Y15608 


Candida albicans 
UBI3 sene 


0.21 


<NONE> 


<NONE> 


<NONE> 


501 


V00598 


Human beta-tubulin 
pseudoaene. 


0.21 


<NON£> 


<NONE> 


<NONE> 


502 


X79426 


A.thaliana 
microsatellile 
[repeated motif 
(sat)71 


0.21 


<N0NE> 


<NONE> 


<NONE> 


503 


X75772 


A.caerulescens 
mitochondrial genes 
for cytochrome b and 
NADH 

dehydrogenase 5 


0.21 


139626 


PROTEIN Tl PRECURSOR 


7.S 


504 


AF028736 


Serratia marcescens 
site specific 
recombinase 


0.21 


3645960 


(AL031583) 1- 

evtdence^predicted by content; 
l-method=geneftnder;084; 1- 
method_score=47.46; 1- 
evidence_end; 2- 
evidence=predicted by match; 2- 
match^access ion=S WISS - 
PROT:P23792; 2- 
match_description=DISCONNE 
CTED PROTEIN.; 2-matc... 


4.6 


505 


X97545 


S.cereWsiae OST5 
sene 


0.21 


2275631 


(AF014940) No definition line 
found [Caenorhabditis elegans] 


2.7 
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SEQ 
ID 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



506 I M24543 



Human prostate- 
specific antigen (PA) 
gene, complete cds. 



0.21 



DESCRIPTION 



1938527 



(U970I2) C04E6.2gene 
product [Caenorhabditis 
eleeansi 



P VALUE 



2.7 



507 I M62470 



Mouse 

thrombospondin 
(THBSl)gene, 
complete cds. 



0.21 



548563 



ktfA k£PLiCA$£ 

POLYPROTEIN 2.7.7.48) - 
Erysimum latent virus 
>gi|3892232 (AF098523) 
replicase protein [Erysimum 
latent virus] 



2.1 



508 



Y 1 3544 



Homo sapiens cosmid 
CI 



0.21 



1235710 



(L40584) polyprotein 
[Infectious pancreatic necrosis 
virus] 



2.0 



509 



510 



M24193 



Chicken MHC B 
complex protein (C12 
3) mRNA, complete 
cds. 



0.21 



3600102 



(AF090441) extracellular reel in 
Gallus gallus] 



X97161 



H. sapiens TFE3 gene 
exon 4,5 & 6 



0.21 



854065 



(X83413) U88 [Human 
lerpesvirus 6] 



0.52 



0.30 



511 



512 



513 



514 



X67649 



U63807 



Z95636 



U9I907 



R.norvegicus DNA 
sequence for 
LFB1/HNF1 
promoter 



0.21 



3913114 



TRANSCRIPTION FACTOR 
COUP 2 COUP-TFII - chicken 
>gi|3928 1 7 (U00697) orphan 
receptor COUP-TFII [Gallus 
gallus] 



Fugu rubripes growth 
hormone (GH) gene, 
complete cds 



0.21 



3510505 



(AF030881) pol polyprotein 
Fugu rubripes] 



NAUl PROTEIN (NONSENSE 
MEDIATED MRNA DECAY 
PROTEIN I) (UP- 
FRAMESHIFT SUPPRESSOR 
) factor NAM7 - yeast 
^Saccharomyces cerevisiae) 
>gi[4023 



H.sapiens mRNA for 
laminin alpha 5 chain 



0.21 



Mirounga leomna 
major 

histocompatibility 
complex class II 
iDQA) gene, partial 
cds 



400350 



0.20 



<NONE> 



0.004 



3e-04 



le-07 



<NONE> 



<NONE> 



515 



Z35758 



Transmissible 
gastroenteritis virus 
TFI virion protein 
eenes 



0.20 



<NONE> 



<NONE> 



<NONE> 



516 X00334 



Drosophila virilis 
simple DNA 
sequence (pDv-19) 



0.20 



<NONE> 



<NONE> 



<NONE> 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















J 1 / 


M7674I 


Homo sapiens biliary 
glycoprotein (BOP) 
gene, partial cds. 


0.20 


<NONE> 


<NONE> 


<NONE> 


^ 1 fi 


D78515 


Mus musculus rae28 
t ~i 

gene, exon I and 
5'flankinH region 


0.20 


<NONE> 


<NONE> 


<NONE> 


519 


M62975 


Drosophila 
meianogaster kjn a 
polymerase II second 
largest subunit 
upstream (DmRP 
140) gene, exons 1-4. 


020 


<NONE> 


<NUfNh> 




520 


M27260 


Chicken 78-kD 
glucose- regulated 
protein, complete cds. 


0.20 


<NONE> 


<INUINn> 


<*1N WIN 


52 i 


AF076470 


Rice tungro 
bacilliform virus 
Serdang strain, 
complete genome 


0.20 


<NONE> 


<NOKE> 


<NONE> 


522 


AF076470 


Rice tungro 
bacilliform virus 
Serdang strain, 
complete eenome 


0.20 


<NONE> 


<NONE> 


<NONE> 


523 


Ul>4o3o 


Human 

cyclooxygenase-2 
(hCox-2) gene, 
complete cqs. 






<NONE> 


<NONE> 


524 


AE001430 


Plasmodium 
falciparum 
chromosome 2, 
section 67 of 73 of 
the complete 
sequence 


0.20 


<NONE> 


<NONE> 


<NONE> 


525 


AF043514 


Mus musculus 
phosphomannomutase 
(Pmm2) mRNA. 
complete cds 


0.20 


3025006 


HYPOTHETICAL 113 KD 
PROTEIN IN MOAE-RHLE 
INTER GE NIC REGION 
>gi|1787009 (AE000181) orf, 
hypothetical protein 
[Escherichia coli] 


9.S 



2>W 
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Nearest Neighbor f BlastN vs. Genbank) 


Nearest Neishbor (BlastX vs. Non-Redundam Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















526 


U23144 


Xenopus laevis FTZ- 
Fl-related nuclear 
orphan receptor 
variant (xFFlrAshort) 
mRNA, complete cds. 


0.20 


3184402 


(ABO 14477) period protein 
[Chymomyza costata] 


9.6 


527 


UI4621 


Paracentrotus lividus 
Pax-6 (suPax-6) 
mRNA. complete cds. 


0.20 


465894 


PROBABLE MlLRUi>UMAL 
SIGNAL PEPTIDASE 23 KD 
SUBUNIT (SPC22/23) 
>gi|630688|pir||S44854 
K12H4.4 protein - 
Caenorhabditis elegans 
>gi|289708 (L 14331) homology 
with signal peptidase; coded for 
by C. elegans cDNAs GenBank: 
M79661, M79662 and M79663; 
putative 


7.7 


528 


AF030511 


pleuropneumoniae 
MRP ATPase 
homolog (mrp) gene, 
partial cds; ApxIVA 
var3 (apxlVA) gene, 
complete cds; and 
beta-salactosidase 
(lacZ) gene, partial 
cds 


0.20 


1175966 


HYPOTHETICAL 45.3 KD 
PROTEIN IN THI5 5'REGION 
>gi|1084720|pir||S56193 
probable membrane protein 
YFL062w - yeast 
ISaccharomyces cerevisiae) 


7.2 


529 


AF070581 


Homo sapiens clone 
24540 mRNA 
sequence 


0.20 


542394 


glyoxal oxidase (EC 1.2.3.-) 
precursor - basidiomycete 
'Phanerochaete chrysosporium) 
>gi| 1050302 


5.8 


530 


X75437 


i .mariiima pgiv gene 
for 3- 

phosphoglycerate 
kinase 


0.20 


825648 


£3453 1) coproporphyrinogen 
oxidase [Homo sapiens] 


5.8 


531 


U32686 


Haemophilus 
influenzae Rd section 
I of 163 of the 
complete genome 


0.20 


3309593 


(AF072878) ciliary outer arm 
dvnein beta heavy chain 


5.6 


532 


228081 


S.cerevisiae 
chromosome XI 
readine frame ORF 
YKL08lw 


0.20 


2507201 


CARBON CATABOLITE 
DEREPRESS1NG PROTEIN 
KINASE >gi|1469S03 (L78I29) 
serine/threonine kinase [Candida 
albicans] 


5.5 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Hordeum vulgare 










533 


AF022725 


limit dextrinase 
(HvLD99) gene, 
complete cds 


0.20 


3139154 


(AF064077) adrenocorticotropic 
hormone receptor [Sus scrofa] 


4.3 


534 


AL021726 


Drosophila 
melanogaster cosmid 
171E4 


0.20 


3885334 


(AC005623) putative argonaute 
protein f Arabidopsis thaliana] 


2.6 


535 


AB0I2106 


Brassica rapa mRNA 

* 

for SRK45, complete 
cds 


0.20 


4008334 


(Z92824)B0413.4 
[Caenorhabditis eleszans] 


1.5 


536 


Z46606 


H. sapiens HLTF gene 
for helicase-Iike 
transcription factor 


0.20 


132946 


60S KIBUSOMAL^KU 1 LIN 
L30B (RP29) cytosolic - yeast 
(Saccharomyces cere vis iae) 
>gi|171821 not determined) 
[Saccharomyces cerevisiae] 
>gi| 1045254 cerevisiae] 
>si 1323250|gnl|PID|e243708 
(Z72933) ORF YGR148c 
[Saccharomyces cerevisiae] 


1.5 


537 


X87193 


H. sapiens mRNA for 
2 19 <*ene 


0.20 


139820 


DNA-REPAIR PROTEIN 
XRCC1 


1.5 


538 


L77965 


Clostridium 
perfringens C beta 2 
toxin gene, complete 
cds 


0.20 


1175950 


HYPOl'HtllCAL 3J.DKJL) 
PROTEIN IN SEC53-ACT1 
INTERGENIC REGION 
>gijl084703|pir]|S5621l 
probable membrane protein 
YFL044c - yeast 
(Saccharomyces cerevisiae) 
>gi[83671 l|gnl|PID|dl009835 
(D50617) YEL044C 


1.4 


539 


M15938 


Chicken neural cell- 
adhesion molecule (N- 
C AM) gene, exon 1 8. 


0.20 


2133082 


regulatory protein MSRl - yeast 


LI 


540 


AJ003220 


Solanum tuberosum 
mRNA for extensin- 
like protein, partial 


0.20 


2496932 


HYPOTHETICAL KD 
PROTEIN C56G2.1 IN 
CHROMOSOME 111 
>gi|7264l3 (U23177)C56G2.1 
gene product [Caenorhabditis 
elegans] 


1.1 


541 


X98103 


A. thai i ana psbP gene 


0.20 


1 19227 


EPIDERMAL GROWTH 
FACTOR PRECURSOR 
precursor - mouse >gi|309210 
(J003S0) prepro-egf[Mus 
musculus] 


0.49 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DfcoLKlr i IVr* 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















542 


AB011179 


Homo sapiens mRNA 
for KIAA0607 
protein, partial cds 


0.20 


2143753 


gene VGF protein - rat 
>gi|205690(M60525) nerve 
growth factor inducible protein 
(Rattus norvegicus] >gi|2 05701 
(M60522) nerve growth factor- 
inducible protein [Rattus 
norvegicus] >gi)2 f J765 1 


0.39 


543 


X753I8 


H.sapiens ITIHI gene 
(exon 22) and IT1H3 
gene 


0.20 


629557 


RN A-binding protein mpD - 
Arabidopsis thaliana (fragment) 
>gi|5 10240 (X61 108) RNA 
binding protein (Arabidopsis 
thaliana] 


0.38 


544 



AB008374 


Oncorhynchus mykiss 
mRNA for alpha 3 
type I collagen, 
partial cds 


0.20 


1082610 


mufl protein - human 
>gi|762953 (X86018) mufl 
[Homo sapiens] 


0.37 


545 


U09809 


Limulus polyphemus 
arginine kinase 
mRNA, complete cds. 


0.20 


3882016 


(AJ012650) CP [Papaya 
ringspot virus] 


0.37 


546 


AB020671 


Homo sapiens mRNA 
for KIAA0864 
protein, partial cds 


0.20 


2674350 


(U93121)M-phase 
phosphoprotein-1 [Homo 
sapiens] 


0.IS 


547 


L04457 


Pnytopntnora 
megasperma 
mitochondrial 
ORF152, complete 
cds, cytochrome c 
oxidase sub unit I 
(coxl) gene, 
complete cds, 
cytochrome c oxidase 
subunit II 


0.20 


746516 


(U23517)D1022.7 
[Caenorhabditis elegans] 
>gi|3258651 elegans] 


0.043 


54S 


L04457 


^hytophthora 
megasperma 
mitochondrial 
ORF152, complete 
cds, cytochrome c 
oxidase subunit I 
(cox I) gene, 
complete cds. 
cytochrome c oxidase 
subunit II 


0.20 


746516 


(U235I7) D1022.7 
[Caenorhabditis elegans] 
>gi|325.S65l elegans] 


0.04: 
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SEQ 
ID 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



|CdJor=cycIin- 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



549 I S828I9 



[dependent kinase 5 
regulatory subunit 

|p35 [mice, brain, 
129/SvJ, C57BIV6, 

IGenomic/mRNA, 

[5523 nt] 



550 I D3I792 



Streptomyces griseus 
DNA for 
serine/threonine 
protein kinases, 
[complete cds 



DESCRIPTION 



P VALUE 



0.20 



3413870 



551 



U97499 



[Homo sapiens 
butyrophilin (BT3.2) 
gene, exons 5-10, and 
complete cds 



552 I U3I463 



[Rattus norvegicus 
nonmuscle myosin 
heavy chain-A 
mRNA. complete cds 



(AB007923) KIAA0454 protein 
|[Homo sapiens] 



0.20 



861405 



(U29I54) T07F12.2 gene 
product [Caenorhabditis 
[elegans] 



0.20 



2773341 



(AF040954) putative protein 
phosphatase 1 nuclear targeting 
[subun it [Rattus norvesicusl 



553 I X78401 



554 



X57310 



555 | X62386 



Bacteriophage P22 
right operon, orf 48, 
replication genes 18 
and 12. nin region 
genes, ninG 
phosphatase, late 
control gene 23, orf 
60, complete cds, late 
control region, start 
of lysis gene 1 3 



Nocardia 

lactamdurans pcbAB 
and pcbC genes for 
alpha-aminoadipyl-L- 
cysteinyl-D- valine 
synthetase and 
isopenicillin N 
synthase 



0.020 



0.019 



0.008 



0.20 



3880111 



(28 1 130) predicted using 
Genefinder 



0.20 



1 123087 



(U42436) C49H3.3 gene 
product [Caenorhabditis 
[elegans] 



S.epidermidis genes 
epiY'. cpiY, epiA, 
epiB. epiC. epiD. 
iQ. epiP 



0.20 



1723511 



PUTATIVE ENDONUCLEASE 
CIF12.06C yeast 

(Schizosaccharomyces pombe) 
>gi|!2I79S0 (Z69944) unknown 
[Schizosaccharomyces pombe] 



0.002 



4e-04 



0.20 



3874927 



(Z73424) C44B9.I 
[[Caenorhabditis elegans] 



4e-09 



3c- 10 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












{ZJ&kjizu) similar to nucleotide 




556 


X59000 


Epizootic 

haemorrhagic disease 
virus gene segment 6 
for NS1 


0.20 


• 

3879755 


Dmaing protein; cufN A hi> 1 
EMBL;M75897 comes from this 
gene; cDN A EST 
EMBL:M89054 comes from this 
gene; cDNA EST 
EMBL:D26713 comes from this 
gene; cDNA EST 
EMBL;D26718 comes from this 
gene;cDNA... 


8e-I6 


557 


M98776 


Human keratin 1 
gene, complete cds 


0.20 


1086900 


(U41278) contains similarity to 
G beta repeats 


2e-30 


558 


AF0U446 


Mus musculus 
granzyme K gene, 
complete cds 


0.19 


<NONE> 


<NONE> 


<NONE> 


559 


AF074708 


Macaca mulatta clone 
MMUI.5 FRGMike 
pseudogene, exons 7 
and 8. partial 
sequence 


0.19 


<NONE> 


<NONE> 


<NONE> 


560 


X13287 


Medicago sativa 
noduIin-25 gene 


0.19 


<NONE> 


<NONE> 


<NOKE> 


561 


Z49509 


S.cerevisiae 
chromosome X 
reading frame ORE 
YJR009c 


0.19 


<NONE> 


<NONE> 


<NONE> 


562 


D8904I 


Bovine DNA for 
prostaglandin 
F2alpha receptor, 
partial cds 


0.19 


<NONE> 


<NONE> 


<NONE> 


563 


D29644 


Streptococcus 
salivarius DNA for 
dextranase 


0.19 


<NONE> 


<NONE> 


<NONE> 


564 


AE00146L 


Helicobacter pylori, 
strain J99 section 22 
of 132 of the 
complete genome 


0.19 


<NONE> 


<NONE> 


<NONE> 


565 


L3S559 


Homo sapiens 
galactocerebrosidase 
(GALC) gene, exon 
17. 


0.19 


<NONE> 


<NONE> 


<NONE> 


566 


Z8262S 


R.prowazekii 
genomic DNA 
fragment (clone 
A405F) 


0.19 


<NONE> 


<NONE> 


<NON"E> 



Si 
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Nearest Neighbor {BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Red undnni Pmtpin^ 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 



567 I U2564I 



Tetrahymena 
thermophila 
telomerase 
component p80 
[mRNA. complete cds 



0.19 



<NONE> 



<NONE> 



<NONE> 



568 



Human mRNA for 
KIAA0345 gene, 
AB002343 complete cds 



0.19 



<NONE> 



569 



571 



Erwinia carotovora 

gene for pectate lyase 

D 10064 HI, complete cds 

(Homo sapiens clone 

MFU8 A4A10 

hypoxanthine 

phosphoribosyltransfe| 

rase (hprt) 130 kb 

(deletion mutant 

mRNA. panial cds, 

contains human Alu 

|element 

(Plasmodium 

falciparum 

chromosome 2, 

section 23 of 73 of 

the complete 

AE001386 (sequence 



0.19 



<NONE> 



570 U31734 



0.19 



<NONE> 



0.19 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



572 I M95623 



Homo sapiens 
hydroxymethylbilane 
synthase gene, 
[complete cds. 



019 



<NONE> 



573 



574 



575 



S67478 



(GC*IS)=vitaminD- 
binding protein/group | 
specific component 
[human, peripheral 
blood leukocytes, 
Genomic, 794 m, 
[segment 4 of 9] 



0.19 



<NONE> 



X99075 



H.sapiens NRGN 
eene. exon I 



0.19 



<NONE> 



AF044775 



Homo sapiens 
breakpoint cluster 
region BCRderl4 
| segue nee 



0.19 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BtastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human mRNA for 










576 


ABO02333 


KIAA0335 gene, 
complete cds 


0.19 


<NONE> 


<NONE> 


<NONE> 


577 


U53566 


Macaca mulatta pit- 
1/GHF-l 

transcription factor 
mRNA. complete cds 


0.19 


1078068 


probable membrane protein 
YLR31 1c - yeast 


9.2 


578 


U73664 


Human 

t(ll;14)(ql3;q32) 
breakpoint junction 
sequence 


0.19 


116734 


COAT PROTEIN (CAPS ID 
PROTEIN) virus >gi|5890i 
(X62133) CyMV coat protein 
gene product 


8.8 


579 


AF004054 


Heterophyllaea 
pustulate rps!6 gene, 
chloroplast gene, 
partial intron 
sequence 


0.19 


1928991 


(U92815) heat shock protein 70 
precursor [Citrullus lanatus] 


8.7 


580 


227081 


Caenorhabditis 
elegans cosmid 
M01A8. complete 
sequence 
[Caenorhabditis 
elegans] 


0.19 


2496247 


HYPOTHETICAL ATP- 
BINDING PROTEIN MJ0625 
>gi|2 1 284 1 3|pir|| A6437S 
hypothetical protein MJ0625 - 
Methanococcus jannaschii 
>gi| 159 1336 (U67510)M. 
jannaschii predicted coding 
resion MJ0625 


8.6 


581 


Z74145 


S.cerevisiae 
chromosome IV 
reading frame ORF 
YDL097c 


0.19 


1174425 


TYROSINE -PROTEIN 
KINASE SPK-1 


6.7 


582 


D38547 


Small round 
structured virus 
genomic RNA, 
3 'terminal sequence 
containing ORF2 and 
ORF3 


0.19 


971318 


(Z48053) putative protein 
Bovine herpesvirus 1] 


5.1 



1.13 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Protein^ 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






lcaniuniu euuupua 










583 


- 

D88000 


RNA > :: 

dbj|D88002|D88002 
Ralstonia eutropha 
DNA for 16S 

ri fincri m i 1 T? M A "v 

dbj|D88003|D88003 
Ralstonia eutropha 
DMA tor loo 
ribosomal RNA > :: 
dbj|D88O04|D880O4 
Ralstonia eutropha 
DNA for 16S 
ribosomal RNA 


0.19 


- 

3800952 


(AF100657) No definition line 
found [Caenorhabditis elegans] 


5.1 


584 


U67462 


Methanococcus 
jannaschii section 4 
of 150 of the 
complete genome 


0.19 


3183617 


(AJ005586) MYB-reiated 
transcription factor 
[Antirrhinum majus] 


4.0 


585 


L23906 


Gallus domesticus 
microsatellite DNA 
marker. 


0.19 


1947094 


(U93074) voltage-gated sodium 
channel homoloe BdNal 


3.9 


586 


AE001462 


Helicobacter pylori, 
strain J99 section 23 
of 132 of the 
complete genome 


0.19 


1730177 


GLUCOSE-6-PHOSPHATE 
ISOMERASE (GPI) 
SOMERASE) (PHI) 
>gi|2118333|pir||I4S073 glucose 
phosphate isomerase - Chinese 
hamster >ei|987046 sriseus] 


3.9 


587 


M 19460 


P.putida catBC 
operon encoding 
cis T cis-muconate 
actonizing enzyme I 
and muconolactone 
isomerase genes, 
complete cds. 


0.19 


3873843 


(Z8^25b)cUNA hSl 
yKZj ig / .j comes irom mis 
gene; cDNA EST yk251g7.5 
comes from this gene; cDNA 
ESTEMBL:D68223 comes 
from this gene; cDNA EST 
EMBL:CI2737 comes from this, 
gene; cDNA EST yk389c8.5 
comes from this gene; cDNA 
E... 


3.9 


588 


U22349 < 


^etrahymena australis 
telomerase RNA 
aenc. complete 
sequence 


0.19 


1 

4105782 


[AF049922) PGP169-12 
Petunia x hybridal 


3.2 
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Nearest Neighbor {BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















JO J 


L27745 


Homo sapiens voltage 
operated calcium 
channel, alpha- 1 

_ t * . ffc VTA 

subumt rnRNA, 
comptete cds. 


0.19 


3763926 


(AC004450) unknown protein 
[Arabidopsis thaliana] 


3.0 




AF049588 


Canis familiar is 
synapsm I gene, 
partial cds 


0.19 


4104931 


(AF042196) auxm response 
factor 8 [Arabidopsis thaliana] 


3.0 


591 


X06627 


Staphylococcus 
aureus plasmid pSl94 
sequence 


0.19 


137927 


PRE - N EC K ' APPhN L) AGE 
PROTEIN (LATE PROTEIN 
GP12) >gi|75856]pir)|WMBP22 
gene 12 protein - phage phi-29 
>gi(2 15330 (M14782) pre-neck 
appendage protein 
[Bacteriophage phi-29] 
>gi|225367Iprf)|1301270G gene 
12 [Bacteriophage phi-29] 


2.3 


592 


X61597 


M.musculus gene for 

kallikrein-binding 

protein 


0.19 


2982874 


(AE000675) cobalarnin 
synthesis related protein CobW 


1.7 


593 


AFO 16242 


Dictyostelium 
discoideum protein 
synthesis elongation 
factor 1 -alpha (tefZ) 
sene, partial cds 


0.19 


133659 


PUTATIVE RN A- DIRECTED 
RNA POLYMERASE 


1.4 


594 


AF004447 


Venezuelan equine 
encephalitis virus 

polyprotein gene, 
partial cds > :: 
gb|AF004460|AF004 
*fou Venezuelan 
equine encephalitis 
virus strain 1385 
polyprotein gene, 
partial cds 


0.19 


4096173 


(U25968) early embryosenesis 
protein [Oryza sativa] 


1.3 


595 


J04821 


Human elastin (ELN) 
gene, exon L clones 
HELC-5 and HELC- 
6. 


0.19 


1170523 


INHIBIN BETA B CHAIN 
PRECURSOR inhibin precursor 
-bovine >gi|563753(U16241) 
betaB inhibin/activin precursor 
[Bos taurus] 


1.3 


596 


AF059650 


Homo sapiens histone 
deacetylase 3 
(HDAC3) gene, 
complete cds | 


0.19 


3024881 


PROBABLE TRANSPORT 
PROTEIN CY21C12.U 
>gi|2078066|gnl|P!D|e3 15171 
(Z95210) betP 


0.83 
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Nearest Neiehbor fBbstN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCFSSTON 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












1- bRKEDUXlN- UbPhNUEN I* ' 




597 


M690D3 


D.melanogaster 
calcium-activated K+ 
channel subunit 


U. IV 




GLOTAiMA I fTS YN 1'HAiJb 1 
(FD-GOGAT) 
>gi|2126524|pir||S60228 
elutamate svnthase ffercedoxin) 
(EC 1.4.7.1) gltB- 
Synechocystis sp. (PCC 6803) 
>gi|5 15938 (X80485) glutamate 
synthase 


U-Ov 


598 


AF076279 


Dictyostelium 
rirrni oasis piasmia 
Dfpl, complete 
plasmid sequence 


0.19. 


453986 


(U00008) yejA [Escherichia 
coli) 


0.79 


599 


D28873 


Mouse MCNP gene 
for C-type natriuretic 
peptide, complete cds 
(exonl. exon2) 




ZOJU444 


(AE00l092)acetyl-CoA 
synthetase (acs-l) 
[Archaeoglobus fulgidus] 




QUU 


U0607 1 


Oxytricha nova 
macronuclear actin II 
gene, complete cds. 


0.19 


1584024 


complement coniroi protein 
[Botryllus schlosseri] 


0.48 


601 


L54057 


Homo sapiens CLP 
mRNA, partial cds. 


0.19 


3036883 


(ALQ22374) putative ABC 
transporter 


0.46 




XS9806 


F.lividius cDNA for 
COLL2alpha gene 


0.19 


3638957 


(AC004877) sco-spondin-mucin- 
nice, similar 10 ryoio* uncertain 
[Homo sapiensl 


0.41 


603 


AE00I104 


Archaeoglobus 
fulgidus section 3 of 
1 72 of the complete 
eenome 


0.19 


2315192 


(Yl 1739) transcription factor 
[Homo sapiens] 


0.35 


604 


U54501 


Rattus norvegicus 
micrcr>aiciii[c 
sequence D0Mco22 


0.19 


228951 


D-MeAsp 

rp cent n r I S OT YPE =e n s i 1 on 3 
[Mus musculus] 


0.32 


605 


X74468 


Human 

papillomavirus type 
15 genomic DNA 


0.19 


3695390 


(AF096371) contains similarity 
to Rattus norvegicus cyclin G- 
associated kinase (SW:P97874) 
[Arabidopsis thaliana] 


0.28 


606 


U20285 


Human Gpsl (GPSl) 
mRNA. complete cds 


0.19 


2582659 


(AJ002527) glucitol-6- 
phosphate dehydrogenase 
[Clostridium beijerinckii] 


0.27 


607 


D49408 


Human gene for 
interleukin 3 receptor 
alpha subunit, ex on 
10 


0.19 


2522368 


( AF008596) alpha 1,3- 

fucosyl transferase [He) icobacter 

pylori] 


0.16 



1^ 
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Nearest Neighbor fBIastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















608 


AF041U1 


Homo sapiens 
pituitary specific 
homeodomain protein 
(PROP 1) gene, exon 
3 and complete cds 


0.19 


37403 


(X03541) trk gene product (aa 1- 
641) [Homo sapiens] 


0.091 


609 


L12531 


Discopyge ommata 
Ca2+ channel alpha 1 
subunit gene 
sequence. 


0.19 


3618274 


(AJ223219) hypothetical protein 


0.069 


610 


AF052445 


Yellow fever virus 
clone HONG9 
polyprotein gene, 
complete cds 


0.19 


1932822 


(U 15928) KH-domain putative 
RNA binding protein 


0.00 1 


611 


Z36946 


B.anthracis sap gene 
encoding S-layer 
protein 


0.19 


173241 


(L06487) ZIP1 protein 
[Saccharomyces cerevisiae] 


2e-04 


6L2 


AF087984 


Hnmrt ^.Toipni full 
length insert cDNA 
clone YW29A12 


0.19 


3786014 


(AC005499) hypothetical 
protein [Arabidopsis thaliana] 


le-06 


613 


AEO010I0 


Archaeoglobus 
fuloidus action 97 of 
172 of the complete 
genome 


0.19 


3135493 


(AF060248) unknown 
[Arabidopsis thaliana] 


7e-08 


614 


L08965 


Trichosporon 
cutaneum carbamoyl 
phosphate synthetase 
lar^e subunit far*>A) 
gene, partial cds. 


0.19 


1086901 


(U41278)F33G12.3 gene 

nrndurt rCjipnorhnhrfitis 

elegans] 


2e-08 


615 


M91466 


Rattus norvegicus 
A2b-adenosine 
receDtor mRNA. 
complete cds. 


0.19 


2984320 


(AE000773) acetoin utilization 
protein [Aquifex aeolicus] 


6e-09 


616 


X95971 


S.lividans groEL2 
aene 


0.19 


3925277 


(AL03264:!) similar to 
Uncharacterized protein family 
UPF0034, Double- stranded 
RNA binding motif; cDNA EST 
yk489b3.5 comes from this 
gene; cDNA EST yk439g7.5 
comes from this gene 
Caenorhabditis elegans] 


7c- 10 


617 


U 12539 


Schizosaccharomyces 
pombe scd2 (scd2) 
gene, complete cds. 


0.19 


193S549 


(U97016) similar to drosophila 
Rlc 1 gene product ribosomal 
protein L4 (YML4) 
(NID:g459259) 


3e-l4 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundam Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(U97016) similar to drosophila 




618 


U 12539 


Schizosaccharomyces 
pombe scd2 (scd2) 
gene, complete cds. 


0-19 


1938549 


Rlcl gene product ribosomal 
protein L4 (YML4) 
(NID:g459259) 


9e-15 












EMBL:D32434 comes from this 
gene; cDNA EST 
EMBL:D33710 comes from this 
gene; cDNA EST 
EMBL:D34467 comes from this 
gene; cDNA EST 
EMBL:D35005 comes from this 
gene; cDNA EST 
EMBL:D37535 comes from this 




619 


Z68327 


Human DNA 
sequence from 
cosmid U25DI1, 
between markers 
DXS366 and DXSS7 
on chromosome X. 


0.19 


3875774 


gene; ... 

>gi|38787 10|gnl|PID|e 1 348373 
EST EMBL:D337 10 comes 
from this gene; cDNA EST 
EMBL.D34467 comes from this 
gene; cDNA EST 
EMBL:D35005 comes from this 
gene; cDNA EST 
EMBL:D37535 comes from this 
gene; ... 


6e-15 


620 


U66525 


Dictyostelium 
discoideum 
ORFveg 1 14 mRNA, 
comolete cds 


0.19 


3540281 


(AF0561 16) AIM related 
protein fFuau rubripes] 


2e-17 


621 


U25830 


Newcastle disease 
virus isolate Herts/33 
matrix protein 
mRNA, complete cds 


0.19 


2228750 


(U93868) RNA polymerase III 
subunit [Homo sapiens] 


le-18 


622 


U89407 


Mus musculus strain 
BALB/c delta- 
aminolevulinic acid 
dehydratase (Lv) 
mRNA, partial cds 


0.19 


1825764 


(U883 14) C46H 11.11 gene 
product [Caenorhabditis 
elegans] 


3e-25 


623 


AF095598 


Bison bison 
athabascae 
microsatellite BBJ 2 


0.18 


<NONE> 


<NONE> 


<NONE> 


624 


AF064260 


Strongylocentrotus 
purpuraius SRC8 
mRNA. complete cds 


0.18 


<NONE> 


<NONE> 


<NOKE> 



y)4 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















625 


U69533 


Arabidopsis thaliana 
complete cds 


0.18 


<NONE> 


<NONE> 


<NONE> 


626 


D89041 


Bovine DNA for 
prostaglandin 
F2alpha receptor, 
partial cds 


0.18 


<NONE> 


<NONE> 


<NONE> 


627 


M2457 1 


r^i pi v tM i i xrx\ 

discoideum tRNA- 
Glu-GAA gene, clone 
yGluGAA7. 


0.18 


<NONE> 


<NONE> 


<NONE> 


628 




L/.IIlCUIl(J^a5lC[ UVU 

gene required for 
female germ line 
development 


A ) O 
U. J O 


<NUN£> 


<N(JINb> 


<N(JNE> 


629 


AL0102O9 


Plasmodium 
falciparum DNA *** 

PROGRESS *** 
from contig 3-104, 
complete sequence 


0.18 


<NONE> 


<NONE> 


<NONE> 


630 


U67575 


Methanococcus 
junnascnu section 1 1 / 
of 150 of the 
complete genome 


0.18 


111839 


inositol K4,5-triphosphate 
receptor 2 - rat 


8.5 


631 


U28730 


Caenorhabditis 
eiegauo cosrnia 
K10B2 


0.18 


1787604 


{PdiKjuuiji) on, nypotneiicai 
protein [Escherichia coli] 


8,3 


632 


X99798 


L.lactis pepFl & 
pepF2 genes 


0.18 


3406624 


{f\rVf7i giycosomai maiaie 
dehydrogenase [Trypanosoma 
brucei] 


8.1 


633 


AF025306 


Danio rerio band 4.1- 
like protein 4 (nbl4) 
mRNA, complete cds 


0.18 


465445 


PROBABLE NUCLEAR 

ill " 1 Ji\J K-r 1 1 lll^l UWJ ▼ 1( UJ 1 \ Oil Ulll 

Kaplan) >gi|3 34072 (M34651) 
ORF-3 protein [Pseudorabies 
virus] 


7.9 


634 


AF05925I 


vtus musculus 
ipoxygcnase (alox) 
mRNA, complete cds 


0.18 


1655667 


(ZS1368) hypothetical protein 
Rv2393 


66 


635 


222605 


G- domestic us CTCF 
protein mRNA. 


0.18 


481S64 


3-methyl-2-oxobutanoate 
dehydrogenase 


6-6 


636 


AB0II0S6 


^omo sapiens mRNA 
forK!AA05l4 
protein, complete cds 


0J8 


3874158 


(Z81464) predicted using 
Genefinder 


6.4 



J11 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 




DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


p vat r rF 






Caenorhabditis 










637 


Z7 85 36 


elegans cosmid 
C07A4. complete 
sequence 
[Caenorhabditis 
elegans] 


0.18 


3702121 


(AJ01 1681) retinoblastoma- 
related protein [Chenopodiurn 
rubnim] 


6.4 


1 

638 


U67530 


ivie t n anococcus 
jannaschii section 72 
of 150 of the 
complete genome 


0.18 


3877946 


(Z81094) Weak similarity to 65 
KDA beat shock protein 
(TR:G602231);cDNA EST 
tmoLU/ 1 comes irorn mis 
gene; cDNA EST 
EMBL:D74382 comes from this 
gene [Caenorhabditis eleeans] 


6.3 


639 


M63781 


influenza 

A/Duck/England/ i/62 
(H4N6) nucleoprotein 
mRNA, complete cds. 


0.18 


3873663 


rZbyb34) cDNA kSi * 
EMBL:D7I5 10 comes from this 
gene; cDNA EST 
EMBL:C08449 comes from this 
gene, cijna £.0 1 ytcioooi^.-? 
comes from this gene; cDNA 
col yk-iooulii.j comes from 
this gene; cDNA EST 
yk461h7.3 comes from this 
gene; cDNA... 


6.2 


640 


M7378I 


Oryctolagus 
cuntculus integrin 
beta-8 subunit 
m rviN a , complete cus. 
> :: gb|I44328[I44828 
Sequence 3 from 
patent US 5635601 


0.18 


1362129 


major allergen OLE 17 - 
common olive 


5.8 


641 


X672I9 


D.melanogaster Rop 
gene . * 


G.I 8 


3449286 


(AB01I527) MEGF1 [Rattus 
norvegicus] 


4.8 


642 


AF 106941 


Homo sapiens beta- 
arrest in 2 mRNA, 
complete cds 


0.18 


548353 


[PROTEIN-PII] 
URIDYLYLTRANSFERASE 
vinelandii >gi|39257 (X59610) 
uridylyl transferase 


3-7 


643 


AF0526O2 


Danio rerio 
huntingtin (HD) 
mRNA, complete cds 


0.18 


241058 


potential IGF binding protein 
(chickens, Peptide Partial, 77 aa, 
segment 2 of 3] 


3.6 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(ZbMji4^ predicted using 




644 


AB020709 


Homo sapiens mRNA 
for KIAA0902 
protein, complete cds 


0.18 


3875570 


Oeneftnder; cDfNA bi>l 
EMBL:M75775 comes from this 
gene; cDNA EST 
EMBL:M89255 comes from this 
gene; cDNA EST 
EMBL-.M89127 comes from this 
gene; cDNA EST 
EMBL:T0014I comes from this 
gene; cDNA EST EMBL:T... 


2.1 


645 


AF096883 


HIV- 1 isolate patient 
3 country USA pol 
polyprotein (pol) 
gene, partial cds 


0.18 


3250696 


(AL024486) putative protein 


1.7 


646 


L3992S 


Pyrocoelia miyako 
(clone pB-PmL4I) 
luciferase mRNA, 
complete cds 


0.18 


2914702 


(AC003974) unknown protein 
[Arabidopsis thaliana] 


0.73 


647 


Ml 7082 


Human 

carcinoembryonic 
nonspecific 
crossreacting antigen 
(CEA; NCA) gene, 
exons 1 and 2. 


0.18 


1351833 


REGULATORY PROTEIN 
ABAA 


0.72 


648 


X75318 


Rsapiens ITIHI gene 
(exon 22) and ITIH3 
Rene 


0.18 


629557 


RNA-binding protein mpD - 
Arabidopsis thaliana (fragment) 
>gi|5 10240 (X61108) RNA 
binding protein (Arabidopsis 
thaliana] 


0.41 


649 


AFOII908 


Mus musculus 
apoptosis associated 
tyrosine kinase 
(AATYK) mRNA, 
complete cds 


0.18 


330442 


(K03332) nuclear antigen 2 
[Epstein-Barr virus] 


5e-04 


650 


U04004 


Simian 

immunodeficiency 
virus SlVagmVER-2 
envelope protein 
gene, partial cds. 


0.1S 


135102 


ASFAK'J VL-lkNA 
SYNTHETASE aspartate- 
tRNA ligase (EC 6.1.1.12) - 
Escherichia coli coli] 
>gi| 1 7365 1 3 |gn I|PID|d 1 0 1 640 1 
(D90829) Aspartate-tRNA 
ligase (EC 6.1.1.12) 
[Escherichia coli] 


6c- n 


651 


U88155 


Xenopus laevis 
RanOTPase 
activating protein 


0.1S 


995714 


(X9125S) pid:el93503 
[Saccharomyces cerevisiae] 


2e-l3 



^1 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












{/.bbs i I ) similar to nooiunase; 




652 


Z 1 8921 


B.oleracea gene for S- 
receptor kinase-Iike 
protei n 


0.18 


3875535 


cDNA ES 1 kMBL:U64»S3J " 
comes from this gene; cDNA 
EST EMBL:D65938 comes 
from this gene; cDNA EST 
yk280h9.3 comes from this 
gene; cDNA EST yk280h9.5 
comes from this gene; cDNA 
EST yk223dll.3 come... 


le-I9 


653 


M60650 


S.cerevisiae STA2 
gene, complete cds. 


0.16 


<NONE> 


<NONE> 


<NONE> 


654 


U80912 


Eucalyptus globulus 
NADP-isocitrate 
dehydrogenase 
(EglCDH) mRNA t 
complete cds 


0.16 


3766172 


(AF057298) ornithine 
decarboxylase anttzyme 2 [Mus 
musculus] 


4.2 


655 


AF0I2899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


0.16 


76749 


hypothetical protein 4 - fowl 
adenovirus 1 


4.0 


656 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA. complete 
cds 


0.16 


30440S6 


(AF055904) unknown 
fMyxococcus xanthus] 


060 


657 


AF030231 


Glycine ma.\ sucrose 
synthase (SS) mRNA, 
complete cds 


0.078 


<NONE> 


<NONE> 


<NONE> 


658 


M19133 


Woodchuck hepatitis 
virus (WHV), 
complete genome, 
clone WHV 59. 


0.072 


1076190 


cell wall glycoprotein, 75K, 
precursor - diatom 
(Cylindrotheca fusiformis) 
>gi|5 15363 (X80394) P75K 
gene product [Cylindrotheca 
fusiformis] 


6.3 


659 


U31557 


Ovine adenovirus 
IVa2 protein gene, 
DNA polymerase 
gene, terminal protein 
gene and 52.55 kDa 
protein gene, partial 
cds 


0.072 


3511 143 


(AF061244) unknown 
[Agrocybe aeeerita] 


6.2 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Caenorhabditis 










660 


AL02I491 


elegans cosmid 
Y44A6B, complete 
sequence 
[Caenorhabditis 
elegansl 


0.070 


<NONE> 


<NONE> 


<NONE> 


661 


M33874 


X.laevis Xotch 
protein mRNA, 
complete cds. 


0.070 


1654096 


(Y09076) RAD3 
[Schizosaccharomyces pombe] 


0.23 


662 


ABO 12725 


Mus musculus 
ZAN75 mRNA for 
zinc finger protein, 
complete cds 


0.069 


1350800 


MITOCHONDRIAL 
RIBOSOMAL PROTEIN S5 


2.0 


663 


AL02149I 


Caenorhabditis 
elegans cosmid 
Y44A6B, complete 
sequence 
[Caenorhabditis 
eleeans] 


0.068 


<NONE> 


<NONE> 


<NON"E> 


664 


Z603 1 8 


H. sapiens CpG DNA, 
clone lei, reverse 

r**nH r*r>o 1 1 r 1 i 


0 068 




(U55376)F16H11.2 gene 
product [Caenorhabditis 

a} nine 1 
Clc££ull5 ] 


-i.U 


665 


Z35973 


S.cerevisiae 
chromosome II 
reading frame ORF 
YBR104w 


0.068 


2493000 


KKUbAHLh iULLU\ \ L- 
COA:3-KETOACID- 
COENZYME A 

TRANSFERASE PRECURSOR 
EMBL:Z14816comes from this 
gene; cDN A EST 
EMBL:ZI4946 comes from this 
gene; cDNA EST 
EMBL:D69746 comes from this 
gene; cDNA EST yk2 19b6.3 
comes from this gene; cDNA 
ES„. 


0.6S 


666 


Z86111 


Streptomyces lividans 

• * 

rpsP, trmD, rpIS, 
sipW, sipX, sipY, 
sipZ, mutT genes and 
4 open reading 
Yames 


0.068 


1235974 


(X96713) collagen [Globodera 
pallidal 


4e T 04 


667 


M729S0 


Anthonomus grandis 
vitellogenin gene 
(VTG), complete cds. 


0.068 


3242750 


(AC005 164) match to ESTs 
AA731 149 (NID:g214013S), 
AA73190S (NID:g2752719). 
AA287837 (NID:gl933519), 
AA262811 (NID:glS9S3S2X 
and AAS25S20(NID:g2S99132) 


le-59 
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Nearest Neiehbor fBlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















668 


M34161 


Rat tachykinin (PPT) 
gene, exons 5 and 6. 


0.067 


<NONE> 


<NONE> 


<NONE> 


669 


L03811 


Aspergillus niger zinc 
finger protein (ere A) 
gene, complete cds. 


0.067 


<NONE> 


<NONE> 


<NONE> 


670 


M64983 


Human fibrinogen 
beta chain gene, 
complete mRNA. > 
gb|I47706|I47706 
Sequence 3 from 
patent US 5639940 


0.067 


<NONE> 


<NONE> 


<NONE> 


671 


AF014051 


Nicotiana tabacum 
Mg chelatase subunit 
(ChlH) mRNA, 
partial cds 


0.067 


<NONE> 


<NONE> 


<NONE> 


672 


Y07540 


H.sapiens sil gene 


0.067 


92331 


glycoprotein GP330, renal - rat 
(fragments) 


7.5 


673 


AJ000347 


Rattus norvegicus 
mRNA for ^(W- 
bisphosphate 
nucleotidase 


0.067 


129238 


25 KD OOKINE 1 h SUKi-ALb 
ANTIGEN PRECURSOR 
(PRS25) >gi|320962jpir||A44966 
25k ookinete surface antigen 
precursor - Plasmodium 
reichenowi reichenowij 


7.4 


674 


LI 9979 


Squid sodium channel 
mRNA, complete cds. 


0.067 


2128473 


hypothetical protein MJ0750 - 
Methanococcus jannaschii 
>gi| 1592304 (U67521) 
ferredoxin-tvpe protein 


1.5 


675 


X08050 


Yeast tRNA-Glu(3) 
gene and flanking 
regions 


0.067 


1334398 


(XI 5081) MURF2 protein (AA 
1-348) 


0.65 


676 


X17115 


Human mRNA for 
IgM heavy chain 
complete sequence 


0.067 


1731331 


^vPrVrwFTlf^Alf <[ 6 tfn 

il T rU 1 rl£ 1 ji.o tVL^ 

PROTEIN CY49.I4C 
>gi|l370241|gnl]PID|e247089 
(Z73966) hypothetical protein 
Rv2075c [Mycobacterium 
tuberculosis] 


0.51 


677 


AF032871 


Homo sapiens 
uncoupling protein 3 
(UCP3) gene, exon I 
and partial exon 2 


0.067 


1 12900 


ALPHA- 2C- J ADRENERGIC 

RECEPTOR human >gi|178l94 
(J03853) kidney alpha-2- 
adrenergic receptor (Homo 
sapiens] >gi|l62863S (U72648) 
alpha2-C4-adrenergic receptor 
[Homo sapiens] 


0.50 
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Nearest Neighbor (BiastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












DYNAiMIN 3 (DYNAMIN, 




678 


X05319 


Mouse class II MHC 
E-beta 2 (d) gene 
exon 3 


0.067 


585074 


1 Co 1 IK*, U L/Vt\ ) 1*1 1 

>gi|39I872|gnl|PID|d 1003668 
(D14076) testicular dynamin 
[Rattus norvegicusj 


3e-04 


679 


AB006362 


i,anuiua aiuicans 
CaSLNl gene, 
complete cds 


0.067 


3417296 


(AC003007) Unknown gene 
product (partial) [Homo sapiens! 


9e-56 


680 


AF02I236 


African horse 
sickness virus capsid 
VP3 (L3) mRNA. 
complete cds 


0.066 


<NONE> 


<NONE> 


<NONE> 


681 


AEO015O7 


Helicobacter pylori, 
strain J99 section 68 
of 132 of the 
complete genome 


0.066 


<NONE> 


<NONE> 


<NONt> 


682 


AF039717 


Caenorhabditis 
elegans cosmid 
R13H8 


0.066 


<NONE> 


<NONE> 


<NONE> 


683 


AF029027 


Syncerus caffer 
isolate Queen 
Elizabeth Mweya 14 
mitochondrial DNA 
control resion 


0.066 


<NONE> 


<NONE> 


<NONE> 


684 


AF087967 


Homo sapiens full 
length insert cDNA 
cloneYU5IG05 


0.066 


2982476 


(X97203) CI protein [Beet curly 
top virus] 


9.5 


685 


J02037 


Baboon endogenous 
virus proviral long 
terminal repeat DNA. 


0.066 


972767 


(L37868) POU-dornain 
transcription factor [Homo 
sapiens] 


13 


686 


AF000141 


Lycopersicon 

pcmlpntii rr"» plncc F 
CawUIcniUiii LlaMv i 

knotted- 1 ik e 
homeodomain protein 
(LeT6) mRNA, 
complete cds 


0.066 


3157926 


(AC002I.31) Strong similarity to 
extensin-Iike protein gb|Z34465 
from Zea mays. [Arabidopsis 
thaliana] 


5.6 


687 


AB001746 


Bensingtonta sp. 
OK255 gene for 18S 
rRNA > :: 

dbj|AB001747|AB00 
1747 Bensin^tonia 
sp. OK259 gene for 
!8S rRNA 


0.066 


3859889 


(AP070064) cap 'n collar 
isoform C [Drosophila 
melanogaster] 


0.38 
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Nearest Neighbor (BtastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Helicobacter pylori. 










688 


AE00I461 


strain J99 section 22 
of 132 of the 
complete genome 


0.065 


<NONE> 


<NONE> 


<NONE> 


689 


M 30821 


Chicken erythroid 
transport proteins cl 
and c2 


0.065 


<NONE> 


<NONE> 


<NONE> 


690 


AB009802 


Homo sapiens gene 
for osteonidogen, 
intron 3 


0.065 


<NONE> 


<NONE> 


<NONE> 


691 


AF086062 


Homo sapiens ful! 
length insert cDNA 
clone YZ06B 1 1 


0.065 


<NONE> 


<NONE> 


<NONE> 


692 


AB002369 


Human mRNA for 
KIAA0371 gene, 
complete cds 


0.065 


2500884 


SIGNAL SEQUENCE 
BINDING PROTEIN binding 
protein [Synechococcus sp.] 


5.5 


693 


AF086864 


Cyclopodia sp- large 
subunit ribosomal 
RNA sene, 

m i trwhonnri n! opnp 

llllim 1UI lul £jV- 111. 

for mitochondrial 
RNAs> partial 
sequence > :: 
gb|AF086866|AF086 
866 Penicillidia sp. 
large subunit 
ribosomal RNA gene, 
mitochondrial gene 
tor mitochondrial 
RNAs, partial 
sequence 


0.065 


3721684 


(AB012957) probable glycosyl 
transferase [Vibrio cholerae] 


5.5 


694 


L44593 


Bacteriophage BK5-T 

cds, 20 ORFs, 
repressor protein, and 
Cro repressor protein 
genes, complete cds, 
ORF70' gene, 5' end 
of cds. 


0.065 


1172067 


PEPTIDASE T 
(AM1NOTRIPEPTIDASE) 
influenzae Rd] 


3-2 


695 


US0079 


Ciona intestinalis 
MyoD- family protein 
(CiMDFa) mRNA, 
complete cds 


0.065 


4218110 


(AL035353) contains EST 
j>b:F152Sl 


2.5 



5>W 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















696 


AB020718 


Homo sapiens mRNA 
for KIAA091 1 
protein, complete cds 


0.065 


1722734 


MINOR CAPSID PROTEIN L2 
>gi|1020I92 type 23] 


1.9 


697 


AF082137 


Zea mays copia-like 
retrotransposon Stl- 
14 leader region, 
partial sequence 


0.065 


1877501 


(U89278) polyhomeotic 2 
homolog [Homo sapiens] 


U 


698 


X64053 


R.norvegicus ZnBP 
gene for zinc binding 
protein 


0.065 


464963 


TRYPSIN PRECURSOR 


0.36 


699 


U67065 


Mus musculus 
butyrophilin (BTN) 
gene, promoter region 
and complete cds 


0.065 


2132252 


hypothetical protein YPL263c - 
yeast 


3e-I0 


700 


M64862 


Rat matrin F/G 
mRNA, complete cds. 


0.065 


3420183 


(AF041 105) organic anion 
transporter protein 3 [Rattus 
norvegicus] 


4e-l9 


701 


K02205 


Yeast (S.cerevisiae) 
transcriptional 
activator of amino 
acid-biosynthetic 
genes (GCN4) gene, 
complete cds. 


0.064 


<NONE> 


<NONE> 


<NONE> 


702 


X58282 


Maize mRNA for a 
high mobility group 
protein 


0.064 


<NONE> 


<NONE> 


<NONE> 


703 


AC001545 


Homo sapiens 
(subclone l_f3 from 
PI H69) DNA 
sequence 


0.064 


<NONE> 


<NONE> 


<NONE> 


704 


AF023461 


Homo sapiens 
FRA3B region 
sequence 


0.064 


<NONE> 


<NONE> 


<NONE> 


705 


U50307 


Caenorhabditis 
elegans cosmid 
F43H9. 


0.064 


<NONE> 


<NONE> 


<NONE> 


706 


U46542 


Streptococcus crista 
-ImpA gene, partial 
cds, putative 
adhesin/ABC 
transport system 
protein (scbA) gene, 
complete cds 


0.064 


1209391 


(D83659) TPR protein pombe] 
>gi|2894282|enl|PID|e 1251 103 
(AL02183S) pre-mrna splicing 
factor. [Schizosaccharomyces 
pombe] 


9.2 


1 707 


X57564 


A.rusticana mRNA 
for neutral peroxidase 


0.064 


1492037 


(U603I5) MC094R [Molluscum 
contagiosum virus subtype 1] 


6.9 



3\1 
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Nearest Nei ghbor (BlastN vs. Gen bank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human alpha-2- 










708 


U06986 


macroglobulin 
receptor/lipoprotein 
receptor protein 
f eene 
exons 39-4 1 . 


0.064 


100800 


rabl5B protein - wheat 
>°il21853 (X6' ? 476) rab protein 
[Triticum aestivum] 


5.3 


709 


D85773 


Human CpG island 
sequence, clone 
Q28B8 


0.064 


2245382 


(U88325) suppressor of 
cytokine signalling- 1 [Mus 
musculus] 


5.3 


710 


L06178 


Artie me Hi fern 

ligustica complete 

mitochondrial 

genome 


0.064 


3695379 


(Ar-UVoi /U) contains similarity 
to a C elegans hypothetical 
protein F44G4.1 (GB:Z49910) 
and several yeast hypothetical 
Droteins such as 35.1 KD 
protein in NAM8-GARI 
intergenic region (SP:P38805) 
[Arabtdopsis thatiana] 


3.2 


711 


Y I 6242 


liluwUlll uColIVUIH 

mRNA For beta- 
amylase 


0.064 


1175958 


HYPO'lHLllLAL ;0.3 lOJ 
PROTEIN IN AGP3-DAK3 
INTERGENIC REGION 
>gi|10847l2|pir||S56201 
probable membrane protein 
YFL054c - yeast 

(^crharnnwrpc cerevi^tnc\ 

\ klUV^l 1U1 \j 1)1 ™ ^ Vi) i v t \ jlUb J 

>gi|83670 1 |gnl|PID|d 1009825 
(D50617) YFL054C 


3.1 


712 


L81779 


(subclone 2_a2 from 
PI H25) DNA 
sequence 


0.064 


3845169 


(AE001391) phosphatase (acid 
phosphatase family) 


0.81 


713 


XI 3826 


C.reinhardtii psbl 
mRNA for OEE1 
protein of 
photosystem II 
(oxygen-evolving 
enhancer protein) 


0.064 


171040 


(M94535) ATPase 
[Saccharomyces cerevisiae] 
cerevisiae, Peptide, 377 aa] 
ISaccharomyces cerevisiae] 


0.054 


714 


X06487 


H.sapiens mRNA for 
bc!2-I° fusion £»ene 


0.064 


2429362 


(AF02026I) proline rich protein 
[Santalum album] 


0.016 


715 


U7963S 


Mus musculus cyclin- 
dependent kinase 
inhibitor protein 
(pl5(lNK4b)) gene, 
exon 2 and partial cds 


0.064 


392922 1 


(AF082557) TRFl-interacting 
ankyrin-related ADP-ribose 
polvmerase [Homo sapiens] 


Ie-10 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human T cell 










716 


U39099 


receptor alpha chain 
mRNA. partial cds 


0.063 


<NONE> 


<NONE> 


<NONE> 


717 


U39673 


Clostridium 
acetobutylicum KdpC 
(kdpC) gene, partial 
cds. sensor histidine 
kinase homolog 
(kdpD) and response 
regulator homolog 
(kdpE) genes, 
complete cds 


0.063 


<NONE> 


<NONE> 


<NONE> 


718 


AL022317 


Human UNA 
sequence from clone 
140L1 on 

chromosome 22ql3.1- 
13.31, complete 
sequence [Homo 
sapiens) 


0.063 


1931640 


(U95973) Serine 
carboxy peptidase isolog 
[Arabidopsis thaliana] 


5.2 


719 


U28972 


Spiroplasma citri orfa 
and orff genes, partial 
cds* orfb, orfc. and 
orfe genes and 
Spiroplasma virus 
SpVl-derived ORFI 
and ORF3 genes, 
complete cds, and 
SpVl -derived ORF14 
gene, partial cds. 


0.063 


4091939 


(AF070704) envelope 
glycoprotein [Human 
immunodeficiency virus type 1 ] 


5.2 


720 


U15159 


Mus musculus limk 
kinase (limk) mRNA, 
complete cds 


0.063 


3638957 


(AC004S77) sco-spondin- mucin - 
like; similar to P98167 uncertain 
(Homo sapiens] 


5.1 


721 


AF058416 


Homo sapiens 
lipoprotein receptor- 
related protein 
(LRP1), exons 39, 40, 
and 41 


0.063 


1788123 


(AE000276) orf, hypothetical 
protein [Escherichia coli] 


4.0 


722 


AE001430 


Plasmodium 
falciparum 
chromosome 2, 
section 67 of 73 of 
the complete 
sequence 


0.063 


2244349 


(Z97337) hypothetical protein 


4.0 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Streptococcus 






(270203) cDNA EST 




723 


L29323 


pneumoniae methyl 
transferase gene 
cluster, complete 
sequence 


0.063 


3874022 


EMBL:D72339 comes from this 
gene; cDNA EST 
EMBL.D75197 comes from this 
eene [Caenorhabditis elegans] 


2.3 


724 


X72631 


H. sapiens mRNA 
encoding Rev- 
ErbAalpha > :: 
emb|X72632|HSREV 
ERB2 H.sapiens 
mRNA encoding Rev- 
ErbAalpha (internal 
fragment) 


0.063 


3979878 


(/./ jitO) predicted using 
fienefindpT" cDNA EST 
EMBL:T01277 comes from this 
gene; cDNA EST 
EMBLT01796 comes from this 
gene, cDNA fcb 1 
EMBL:D32545 comes from this 
gene; cDNA EST 
EMBL:D33060 comes from this 
gene; cDNA EST EMBL;D... 


1.7 


725 


U 17969 


Human initiation 
factor eIF-5A gene, 
complete cds. 


0.063 


2429509 


(AF025467) contains similarity 
to drosophila DNA-binding 
protein K10(NID:g8148) 
[Caenorhabditis elegans] 


1.4 


726 


AE 00 1000 


Archaeoglobus 
fuigidus section 107 
of 172 of the 
complete genome 


0.063 


3462802 


(AF082486) net protein [Human 
immunodeficiency virus type 1] 


0.35 


727 


S80986 


svp[40]=svp-related 
nuclear 

receptor/retmoid 
signaling modulator 
[zebrafishes, mRNA. 
3876 nt] 


0.063 


1326288 


(U58734) weak similarity to 
ankyrin G [Caenorhabditis 
eleaans] 


0.093 


728 


AF109134 


Homo sapiens 7-60 
mRN A/complete cds 


0.063 


- 1083764 


proline-rich proteoglycan 2 
precursor, parotid - rat 
>gi|310200 (L17318) proline- 
rich proteoglycan [Rattus 
norvegicus] 


0.001 


729 


D87466 


Human rnKNA tor 
KIAA0276 gene, 
partial cds 


0.063 


2879865 


unknown, len:251aa 
(Schizosaccharomyces pombe] 


6e-05 


730 


ABO 18269 


Homo sapiens mRNA 
for K1AA0726 
protein, complete cds 


0.063 


2995865 


(AF053455) tetraspan TM4SF 
[Homo sapiens] 


2e-16 


731 


DS6954 


Cricetulus griseus 
mRNA for 
Cytochrome P-450 
2A14, complete cds 


0.063 


2496S96 


HYPOTHETICAL 47.6 KD 
PROTEIN C 1 6C 1 0.5 IN 
CHROMOSOME III 
>gi|3S743S3|gnl|PID|e 1344077 

type (RING finger) 
[Caenorhabditis elegans] 


le-22 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Plasmodium 










732 


ALO 10232 


falciparum DNA *** 
SEQUENCING IN 
PROGRESS *** 
from contig 4-58, 
complete sequence 


0.062 


<NONE> 


<NONE> 


<NONE> 


733 


U90714 


Mycoplasma 
gallisepticum 
hae in agglutinin 
precursor genes, 
complete cds 


0.062 


<NONE> 


<NONE> 


<NONE> 


734 


AF 107044 


Homo sapiens clone 
pCL4 DNA-binding 
protein SOX21 
(SOX2I)gene, 
complete cds 


0.062 


<NONE> 


<NONE> 


<NONE> 


735 


L41729 


Caenorhabditis 
elegans Ro 
ribonucleoprotein 
autoantigen mRNA, 
complete cds 


0.062 


2983060 


(AE000687) putative protein 
fAouifex aeolic us! 


8.6 


736 


Z99287 


Caenorhabditis 
elegans cos mid 
Y7A9D, complete 
sequence 
[Caenorhabditis 
elegans] 


0.062 


1 176542 


PUlAllVt 

SERINE/THREO NINE- 
PROTEIN KINASE D 1044.3 
IN CHROMOSOME HI 
>gi|495684 (U00065) contains 
EGF-like repeats; highly similar 
to ZC84.1; 3' exons similar to 
protein kinase [Caenorhabditis 
elegans] 


5.8 


737 


ABO 145 14 


Homo sapiens mRNA 
forKIAA06I4 
protein, partial cds 


0.062 


4033395 


DNA GYRASE SUB UNIT B 
subunit [Myxococcus xanthus] 


3.9 


738 


L29165 


Human germline 
immunoglobulin light 
chain variable region 
(tambda-IIIb 
subgroup) from IgM 
rheumatoid factor. 


0.062 


1914685 


(Y12014) RAD23 protein, 
isoform II 


1.3 


739 


U09364 


Schistosoma 
japonicum Chinese 
clone pY6 
paramyosin mRNA, 
partial cds. 


0.062 


1350800 


MITOCHONDRIAL 
RIBOSOM AL PROTEIN S5 


1.3 


740 


Y 16242 


Triticum aestivurn 
mRNA for beta- 
amylase 


0.062 


79834 


hypothetical protein 1246 (uvrA 
region) - Micrococcus luteus 
(fragment) 


0.59 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Leishmania pifanoi 






TROPOMYOSIN 1 (TMI) 




741 


M97695 


cysteine proteinase 
(cys2) gene, complete 
cds. 


0.062 


1174754 


(POLYPEPTIDE 49) 
>gi|320989|pir||A60607 
tropomyosin - fluke 


0.018 


742 


U67526 


Methanococcus 
jannaschii section 68 
of 150 of the 
complete genome 


0.062 


1330345 


(uSbo:)) coded ror byf. 
elegans cDNA yk34bl .5; coded 
for by C. elegans cDNA 
ykI3h 10.5; coded for by C. 
elegans cDNA yk46e8.5; coded 
for by C. elegans cDNA 
yk46d5.5; coded for by C. 
elegans cDNA yk43c2.5; coded 
for by C. elegans cDN A 
vk46e8.... 


le-40 


743 


Z73414 


CaenorhaSditis 
elegans cosmid 
W09DI2, complete 
sequence 
[Caenorhabditis 
elegans] 


0.061 


<NONE> 


<NONE> 


<NONE> 


744 


Y 13606 


Mus muse ul us gene 
encoding filensin, 
exons 6, 7 


0.061 


2314715 


(AE000651) H. pylori predicted 
coding reaion HP 1527 


4.9 


745 


J04374 


Eggplant mosaic 
virus senome. 


0.061 


141449 


PROTEIN IN TRANSPOSON 
TN4556 >gi|80759|pir||JQ0431 
hypothetical 35.5K protein - 
Streptomyces fradiae transposon 
Tn4556 


3.8 


746 


AB022200 


vlarine obiigately 
oligotrophy 
bacterium POO-10 
DNA for 16S 
ribosomal RNA, 
partial sequence 


0.061 


3983593 


(AB000307) transcarboxylase- 
Deta 


2.2 


747 


X54250 


Rat mRNA for zinc 
r mger protein AT- 
BP2, partial cds 


0.061 


1377886 


[L46S15) DNA binding protein 
\c fMuS musculus] 


0.9S 


74S 


X69942 


vl. musculus mRNA 
of enhancer- trap- 
ocus 1 


0.061 


2983969 


;AE00074S) putative protein 
Aquifex aeolicus] 


0.57 


749 


AJ223206 


vlus musculus mRNA 
br scrapie responsive 
protein 1 


0.061 


4204265 


f AC005223) 45643 
Arabidopsis thalianal 


5e-31 


750 


Y 10205 


-1. sapiens mRNA for 
CD83 protein 


0.060 


<NONE> 


<NONE> 


<NON~E> 
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Nearest Neighbor ( BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















751 


U79260 


Human clone 23745 
mRNA. complete cds 


0.060 


<NONE> 


<NONE> 


<NONE> 


752 


X07453 


Plasmodium 
falciparum 11-1 gene 
part I 


0.060 


<NONE> 


<NONE> 


<NONE> 


753 


U57502 


Rattus norvegicus 
protein tyrosine 
phosphatase delta 
gene, catalytic 
domain, partial cds. 


0.060 


3452285 


(AF0449I5) polar lube protein 
PTP55 precursor 


— — — — — 

0.28 


754 


X68359 


M'fasctcularis nene 
for apo lipoprotein C- 

m 


0.060 


730843 


SHUTTLE CRAFT PROTEIN 
>gi(487400 


2e-04 


755 


X51634 


Pseudomonas braB 
gene for branched 
chain amino acid 

V* I 1441 M A ui till **V * 

transport carrier (LIV- 
H) 


0.059 


1835622 


(U857L8) CCML [Pseudomonas 
putida GB-1] 


8.1 


756 


AJF072405 


Gossypium hirsutum 
cotton fiber exnressed 
protein 2 (CFE2) 
mRNA T complete cds 


0.059 


423766 


alkaline phosphatase, 145K - 
Synechococcus sp. 


4.7 


757 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


0.056 


2662481 


(AF034859) juvenile hormone 
resistance protein 


3.3 


758 


AF093268 


Rattus norvegicus 
homer- lc mRNA, 
complete cds 


0.054 


547847 


LECTIN PRECURSOR 


7.0 


759 


X61046 


Hydra N-COL 2 
mRNA for mini- 
collagen, partial cds 


0.053 


<NONE> 


<NONE> 


<NONE> 


760 


AJ005813 


Arabidopsis thaliana 
mRNA for 
neoxanthin cleavage 
enzyme 


0.052 


<NONE> 


<NONE> 


<NONE> 


761 


S79843 


{random amplified 
hybridization 
microsatellite 
RAHM) [Beta 
vutearis=sui»ar beets. 
Genomic, 537 nt] 


0.025 


1730145 


GAMETOGENESIS 
EXPRESSED PROTEIN GEG- 
154 >gi|2I37331|pir||I48361 
gene GEG-154 protein - mouse 
>gi|550l23 (X71642) 
pid:s550123 (Mus musculus] 


2e-16 
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Nearest Neighbor (BlastN vs. Gcnbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


nv. Luji vi i 


DESCRIPTION 


P VALUE 


ACCFSSION 


DESCRIPTION 


P V AI I TP 






Mouse mRNA for 










762 


AB000096 


GATA-2 protein, 
complete cds 


0.023 


<NONE> 


<N0NE> 


<NONE> 


763 


Z62366 


ii.sapiens cpu l/ina. 
clone 67h7. forward 
read cpe67h7.ftla . 


0.023 


3123312 


ZINC FINGER PROTEIN 142 
\SiPi/\KJZjO) to Muman zinc 
finger protein(ZNF!42) (Homo 
sapiens] 


5.9 


764 


LI 1670 


Human 

transmemorane 
glycoprotein (CD53) 
gene, exons 2 through 
8. 


0.023 


. 80636 


nypotneticai o/ft. protein - 
Mycobacterium fortuitum 
plasmid p AL5000 >gi| 1 49986 
(M60875) ORF2 


3.4 




D83984 


Sulculus diversicolor 
DNA for IDO-like 
myoglobin, complete 
cds 


0.023 


3114665 


(AF061267) inner membrane 
component HtxE [Pseudomonas 
stutzeri] 


3.4 


766 


X98890 


S. tuberosum mRNA 
for inorganic 
phosphate 
transporter, StPTl 


0.023 


683532 


(X02155) thyroglobulin [Bos 
taurus) 


1.1 


767 


U58835 


Dissostichas mawsoni 
preprotrypsin gene, 
complete cds 


0.022 


<N0NE> 


<NONE> 


<NONE> 


768 


AJ009630 


Glomus versiforme 
chitin synthase gene 
(clone Gvchs3) 


0.022 


<NONE> 


<NONE> 


<N0NE> 


769 


J04040 


Human glucagon 
mRNA, complete cds. 


0.022 


<NONE> 


<NONE> 


<NONE> 


770 


X74908 


L.escutenturn asf j 
gene 


0.022 


<NONE> 


<NONE> 


<NONE> 


111 


L07293 


Shigella dysenteriae 
0- antigen 
polysaccharide 
biosynthesis rfbX. 0- 
antigen polymerase 
(rfc). rhamnosyt 
tranferase I and II 
(rfbR and rfbQ) and 
rfbD genes, complete 
cds. 


0.022 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BbstN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DhbCRlrl iv_>N 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Mus musculus 








'— 


772 


AF040094 


inositol 

polyphosphate 5- 
pnosphatase II 
(INPP5P) mRNA, 
complete cds 


0.022 


<NONE> 


<NONE> 


<NONE> 


773 


X76776 


H.sapiens HLA-DMB 
gene 


0.022 


<NONE> 


<NONE> 


<NONE> 


774 


AE001521 


Helicobacter pylori, 
strain J99 section 82 
ofl32 of the 
complete genome 


0.022 


<NONE> 


<NONE> 


<NONE> 


775 


XI 6004 


AJonga rbcL, rpl5, 
rps8, rp!36, rpsl4, 
rps2 t trnLtrrvF, tmC 
and rpoB (partial) 
genes > :: 

emb|X75651|ALRIBP 
A.longa plastid genes 
for ribosomal 
proteins, tRNAs, 
RNA polymerase 
subunit beta and 
rubisco large subunit 


- - 
0.022 


<NONE> 


<NONE> 


<NONE> 


776 


Y 12707 


Lactococcus lactis 
cremoris plasmid 
pHW393 DNA, 
rlladii, mlladii genes 


0.022 


<NONE> 


<NONE> 


<NONE> 


777 


U27118 


Arabidopsis thaliana 
glutamyl- tRN A 
reductase 


0.022 


<NONE> 


<NONE> 


<NONE> 


778 


Z96622 


H.sapiens telomenc 
DNA sequence, clone 
5PTEL002, read 
5FTELOO002.seq 


0.022 


191333 


;J05503) carbamoyl-phosphate 
synthetase (E.C.6.3.5.5) 


9.8 


779 


D83984 


Sulculus diversicolor 
DNA for IDO-like 
myoglobin, complete 
cds 


0.022 


1078509 


probable membrane protein 
YDR01 8c - yeast 


9.7 


780 


277952 


rLsapiens flow-sorted 
chromosome 6 
Hindlll fragment, 
SC6pA4A3 


0.022 


4204206 


(AB022786) N-acetyl-beta-D- 
glucosaminidase [Enterobacter 
sp] 


7.5 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


j DESCRIPTION 


P VALUE 






Xenopus laevis 










781 


M10217 


mitochondrial DNA, 
complete genome. 


0.022 


2145763 


B2168_C2_205 protein - 
Mycobacterium leprae 


7.3 


-TOO 


M55147 


Pea chloroplast 
glyceraldehyde-3- 
phosphate 
dehydrogenase 
(Gpbl) gene, 
complete cds. 


0.022 


- 

417308 


PROBABLE HELICASE 
MOT1 Motlp is a probable 
helicase essential for vegetative 
growth on rich glucose medium 
at 30 degree C: Swiss-Prot 
Accession number P32333; 
similar to S. cerevisiae RAD26 
gene product: Swiss-Prot 
Accession number P40352 


4.2 


783 


X58839 


Acholeplasma vims 
MV-LI DNA for 
complete circular 
genome 


0.022 


3273189 


(AB008757) subunitll of 
c(o/b)3-type cytochrome c 
oxidase [Bacillus 
stearothermopht lus J 


4.1 


784 


M26I85 


Mouse c-myb 
oncogene, exon I and 
exon 2 (partial). 


0.022 


138592 


VITELLOGENIN I 
PRECURSOR (YOLK 
PROTEIN I) 
>gi| III /U|pir|| VJrr 1 
vitellogenin I precursor 
unnamed protein product 
[Drosophila melanoaaster] 


2.5 


785 


AF061I95 


Streptomyces albus 
valine dehydrogenase 
(Vdh) gene, complete 
cds 


0.022 


2088768 


(AF003 145) B04I4.S gene 
product [Caenorhabditis 
elesans] 


0.86 


786 


AF053622 


Homo sapiens alpha 
1,2-mannosidase IB 
gene, exon 9 


0.022 


1352361 


bAKLY OKU W I M 
RESPONSE PROTEIN I fish 
>gi|53l456(U 12895) egri 
[Danio rerio] rerio] 


0.36 


787 


Z7I500 


S.cerevisiae 
cnromosome aiy 
reading frame ORF 
YNL224c 


0.022 


1708875 


Dl ITATIVP TT WACiXi 

SUPPRESSOR LUCAI5 
sapiens] 


0.16 


788 


D10471 


Herpes simplex virus 
type 2 genomic DNA 
for 0 .74-0.84 region, 
complete cds 


0.022 


3132276 


(AB0M4S6) short ORF [TT 
virus] 


0.13 


789 


U43082 


Zea mays T 
cytoplasm male 
sterility restorer 
factor 2 (rO) mRNA, 
complete cds 


0.022 


3319720 


(AL031035) putative aldehyde 
dehydrogenase [Streptomyces 
coelicolor] 


0.0 11 
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Nearest Neighbor (BlastN vs. Gen bank) 


Nearest Neighbor (BlasOC vs. Non^ Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






H.sapiens simple 










790 


X86913 


tandem repeat DNA 
(clone wg3a6) 


0.021 


<NONE> 


<NONE> 


<NONE> 


791 


AF100694 


Mus rnusculus 
Pontin52 rnRNA, 
complete cds 


0.021 


<NONE> 


<NONE> 


<NONE> 


792 


U340L6 


Nannostomus sp. 
large subunit rRNA 
gene, mitochondrial 
gene encoding 
mitochondrial rRNA, 
partial sequence. 


0,021 


<NONE> 


<NONE> 


<NONE> 


793 


X00845 


Yeast mitochondrial 
genes for 15S rRNA 
and tRNA-Trp 


0.021 


<NONE> 


<NONE> 


<NONE> 


794 


AB012U3 


Homo sapiens gene 
for CC chemokine 
PARC precursor, 
complete cds 


0.021 


<NONE> 


<NONE> 


<NONE> 


795 


U62395 


Daucus carota 
globulin like protein 
(Gea8) gene, 
complete cds 


0.021 


<NONE> 


<NONE> 


<NONE> 


796 


M22718 


P.falciparum actin II 
gene, complete cds. 


0.021 


2623773 


(AF004835) tyrocidine 
synthetase 3 (Brevibacillus 
brevis] 


8.8 


797 


U2711S 


Arabidopsis thai i ana 

glutamyl-tRNA 

reductase 


0.021 


3549885 


(AJ00663 1) cysteine-nch 
secretory protein- 1 [Equus 
caballus] 


8.8 


798 


X99832 


Rsapiens CLN3 
gene, complete CDS 


0.021 


262249 


(S52010) orfl 5'ofEpoR [mice. 
Peptide. 85 aa] [Mus sp.] 


8.7 


799 


AFO 16266 
— , — 


Homo sapiens TRAIL 
receptor 2 mRNA, 
complete cds 


0.021 


729048 


SUCCINYL- 
COArCOENZYME A 
TRANSFERASE transferase 
Clostridium kluyveri] 


8.7 


800 


Z92541 


Human DNA 
sequence from PAC 
179II5, BRCA2 gene 
region chromosome 
13ql2-l3 contains 
lactase- phlorizin 
hydrolase (LCT) 


0.021 


585820 


LlPOPOLySALUlAKlDh 1,2- 
N- 

ACETYLGLUCOSAMINETR 
ANSFERASE >gi|466761 
(U00039) rfaK [Escherichia 
colij >gi| 1790053 (AE000440) 
probably hexose transferase; 
lipopol vsaccharide core 
biosynthesis 


5.3 | 



vn 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 




DESCRIPTION 


P VALUF 




DESCRIPTION 


P \f AT ITC 
r V /YLUfc. 






dopamine D2 










801 


Cc o COO 
JJOJOO 


receptor [human, 
brain. Genomic, 3794 
nt. segment 4 of 51 


U.U21 


2677620 


(Y08029) NAD(P)(+)~arginine 
ADP-ribosyltransferase 
[Oryctolagus cuniculus| 


5.1 


802 




Rat nerve growth 
facior-inducible 
protein (VGF) gene, 
complete cds. 


A ftl 1 


a t mm ,t 

4103934 


(AF03OO5O) replication factor C 
[Rattus norvegicus] 


3.1 


803 


Ar 045 65 4 


Gall us gallus 
neuregulin beta- la 
mRNA,, complete cds 


0.021 


2746829 


(AF040647) No definition line 
found [Caenorhabditis elegans) 


3.0 


804 


M69023 


Human glob in gene. 


0.021 


3880259 


(282056) T26H5.8 
[Caenorhabditis elegansj 
>gi|3880787|gnl|PID|el 350288 
(AL032620) T26H5.8 


2.4 


805 


Z65960 


H. sapiens CpG DNA, 
clone 69d2, reverse 
read cpg69d2.rtlb . 


0.021 


1707245 


(U80845) similar to family 1 of 
G-protein coupled receptors 
(Caenorhabditis eleeans] 


0.79 


806 


X97073 


A.oltgospora gene 
encodins lectin 


0.021 


116949 


CORE ANTIGEN 
>gi|73601|pir||NKVLC2 core 
antigen - woodchuck hepatitis 
virus 2 >gi|336I35 


0.47 


807 


X56491 


D. melanogaster 
mRNA for gene 
containing opa 
repetitive element 


0.021 


2842750 


HOMEOBOX PROTEIN DLX- 
7 >gi| 1620520 


0.16 


808 


L78760 


Homo sapiens 
(subclone l_f6 from 
r 1 H31) DNA 
sequence 


0.021 


113671 


!!!! ALU CLASS F WARNING 
ENTRY 'Ml 


015 


809 


AB007864 


Homo sapiens 
KIAA0404 mRNA, 
partial cds 


0.021 


I1SI44 


LYSlhlNb SVN I BASE A (U- 
ACETYLSERINE 
SULFHYDRYLASE A) fO- 
ACETYLSERINE (THIOL)- 
LYASE A) (CSASE A) 
>gi|68323|pir||S YEBAC cysteine 
synthase (EC 4.2.99.8) A - 
Salmonella typhimurium 
>gi| 153935 (M21450) cysK 
protein [Salmonella 
typhimurium] 


0.12 


810 


AL021932 


Mycobacterium 
tuberculosis H37Rv 
complete genome; 
sesment 22/162 


0.021 


2909514 J 


(AL02 1932) hypothetical 
protein Rv0439c 


7e-l0 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlaslX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















811 


U8999 1 


Hypocrea jeconna 
mannose- 1 -phosphate 
guany 1 yltra nsf erase 
(MPG1) mRNA, 
complete cds 


0.021 


3581924 


(AL031538) mannose- 1- 
phosphate guany (transferase 
[Schizosaccharomyces pombej 


6e-20 


812 


X00641 


Sugar beet 
mitochondrial 
minicircle pO 
sequence 


0.020 


<NONE> 


<NONE> 


<NONE> 


813 


Z50097 


D.melanogaster 
mRNA for hdc 
protein. 


0.020 


<NONE> 


<NONE> 


<NONE> 




AF044866 


rnoeDis sennae large 
subunit ribosomal 
RNA gene, partial 
sequence; tRNA-Val 
gene, complete 
sequence; and small 
subunit ribosomal 
RNA gene, partial 
sequence, 

mitochondrial genes 
for mitochondrial 
RNAs 


0.020 


<NONE> 


<NONE> 


<NONE> 


815 


AF074386 


Sambucus nigra 
he vein- like protein 
mRNA, complete cds 


0.020 


<NONE> 


<NONE> 


<NONE> 


816 


AF027174 


Arabidopsis tha liana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA r complete 
cds 


0.020 


<NONE> 


<NONE> 


<NONE> 


817 


AE001405 


Plasmodium 
falciparum 
chromosome 2, 
section 42 of 73 of 
the complete 
sequence 


0.020 


2196776 


(AF003342) bunched gene 
product [Drosophila 
melanogaster] 


8.4 


818 


AF074387 


Sambucus nisra 
hevein-like protein 
mRNA, complete cds 


0.020 


627071 


listidine-rich protein - 
Plasmodium lophurae 


2.S 



■4 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















819 


Y13304 


Hylobates hoolock 

m i t r\/* h rk n H ri ^ 1 PlN A 

for cytb gene. Horace 


0.020 


285580 


(010043^ ORF fAcetohaeter 
paste uri anus] 


2.1 


820 


Z66539 


H.sapiens creatine 
transporter gene 


0.020 


1703594 


[U8U4jy) coded tor by L. 
elegans cDNA yk7cS.5; coded 
for by C. elegans cDNA 
ykl33b3.5; coded for by C. 
elegans cDNA yk65a4.5; coded 
for by C. elegans cDNA 
yk7c8.3; coded for by C. 
elegans cDNA CEESQ66F; 
coded for by C. elegans cDNA 
yk65a4.3;... 


0.98 


821 


AF053622 


Homo sapiens alpha 
1,2-mannosidase IB 
gene, ex on 9 


0.020 


1352361 


EARLY GROWTH 
RESPONSE PROTEIN 1 fish 
>gi|531456(UI2895)egrl 
[Danio rerio] rerioj 


0.72 


822 


M20555 


Human MHC class II 
HLA-DRw53-beta 
(DR4,w4) gene, 
exons 2,3,4 T 5,6. 


0.020 


465569 


HIPUlHhllLAL 3&.I KL) 
PROTEIN IN SBCB-HISL 
1NTERGENIC REGION 
>gi|405956 (U00009) 
ORFJD:o349#4; similar to 
[SwissProt Accession Number 
P33015] [Escherichia coli] 
>gi|1736693|gnl|PID|dl0I6570 
Number P33015] [Escherichia 
coli] >gi| 1788323 (AE000292) 
putative transport system 
permease protein [Escherichia 
coli] 


0.43 


823 


M20555 


Human MHC class II 
HLA-DRw53-beta 
(DR4,w4) gene, 
exons 2,3,4,5,6. 


0.020 


1709751 


COENZYME PQQ 
SYNTHESIS PROTEIN F 
synthesis F - Pseudomonas 
fluoresces >si|929802 


0.42 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












nrruinEit^aL ujm; ■ 




824 


AJ005015 


Homo sapiens mRNA 
for putative SMC- like 
protein, partial 


0.020 


267449 


PROTEIN ZKW7.2 IN 
CHROMOSOME III 
>gi|102507|pir||S15787 
hypothetical protein 1 (cosmid 
ZK637) - Caenorhabditis 
elegans Genefinder; cDNA EST 
yk217b5.3 comes from this 
gene;cDNA EST yk2I7b5.5 
comes from this gene; cDNA 
EST yk340gl2.3 comes from 
this gene; cDNA EST 
yk340g!2.5 comes from this 
gene; cDNA EST yk428c5.5 

CO... 


Ie-I2 


825 


AF034099 


Laccaria bicolor 
glyoxal malate 
synthase protein 
mRNA, complete cds 


0.020 


1109847 


(U41538) No definition line 
found [Caenorhabditis elegans] 


le-22 


826 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


0.019 


132836 


60S R1BOSOMAL PROTEIN 
L28 protein L28 [Rattus 
norvegicus] 


5.7 


827 


AF093268 


Rattus norvegicus 
homer- lc mRNA, 
complete cds 


0.019 


2633401 


(Z99109) similar to DNA 
ex onuc lease 


4.5 


828 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


0.019 


2492604 


MULTIDRUG RESISTANCE 
PROTEIN CDR2 albicans] 


4.4 


- 

829 


U67538 


Methanococcus 
jannaschii section 80 
of 150 of the 
complete genome 


0.019 


1723566 


FUIAllVh 

GLUCOSYLTRANSFERASE 
C17C9,07 

>gi| L3 141 59jgnl|PID(e24 1 760 
(Z73099)SPAC17C9.07. 
mutative glucosyl transferase len: 
501, similar to 
SW:ALG8_ YEAST P4035 1 
zlucosyltransferase ALG8 
pombe] 


2.7 


830 


U56088 


Human periodic 
tryptophan protein 2 
(PWP2) gene, exons 
3 to 14 


0.019 


2144804 


collagen alpha 1(11) chain - 
)ovine 


0.040 


831 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


0.018 


1916976 


(U9I682) vitelline membrane 
protein homolog [Aedes 
ae-ypti] 


7.2 
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Nearest Neighbor fBlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















832 


AF026258 


Onobrychis viciifolia 
chalcone synthase 
(CHS) mRNA, 
complete cds 


0.018 


763076 


(Z48799) ZP3 [Cyprinus carpio] 
>gi|777724 (L41637) egg 
membrane protein [Cyprinus 
carpio] 


5.2 


833 


U95094 


Xenopus laevis XL- 
INCENP(XL- 
INCENP) mRNA, 
complete cds 


0.009 


3955011 


(AJ005438) beta adrenoreceptor 
B 


0.60 


834 


X7J603 


'C.jcjuni VS1 DNA> 

emb|A396O3|A39603 
Sequence 2 from 
Patent W094 17205 > 
:: gb|I76090|I76090 
Sequence 2 from 
patent US 5691 138 


0.008 


<NONE> 


<NONE> 


<NONE> 


835 


AF093268 


Rattus norvesicus 
homer- lc mRNA, 
complete cds 


0.008 


138116 


HEAD FIBER PROTEIN 
(LATE PROTEIN GP8.5) 
>gi|75846|pir||WMBPSH gene 
8.5 protein - phage PZA 
>gi|216057 (M11813) head 
fiber protein 


8.1 


836 


X91751 


Bovine herpesvirus 
tvDe 1 LTL7 °ene 


0.008 


1711436 


SUPEROXIDE DISMUTASE 
(FE) 1.15.1.1) (Fe)- 
Pseudomonas aeruginosa 
>gi|409767 


5.9 


837 


M95594 


Arabidopsis thai i ana 
I -aminocyclopropane- 
I-carboxylate 
synthase (ACS2) 
gene, complete cds. 


0.008 


683698 


(Z48229) orfl gene product 
[Saccharomyces cerevisiae] 


le-06 


838 


U67465 


Methanococcus 
jannaschii section 7 
of 150 of the 
complete genome 


0.008 


3874664 


(Z68493) predicted using 
Genefinder 


le-07 


839 


X72388 


B.taurus mRNA for 
fi I ens in 


0.008 


100174 


1 -aminocyclopropane- 1 - 
carboxylate synthase 


7e-09 


840 


U22398 


Human Cdk-inhibitor 
P 57KIP2 (KIP2) 
mRNA, complete cds. 


0.00s 


2228750 


(U93868) RNA polymerase III 
subunit [Homo sapiens] 


2e-lS 


841 


L42546 


Xenopus laevis LIM 
class homeodomain 
protein 


0.007 


<NONE> 


<NONE> 


<NONE> 



4^ 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Homo sapiens 










842 


AF041428 


ribosomal protein s4 
X i so form gene, 
complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


843 


AF000227 


Secale cereale omega 
secalin gene, 
complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


844 


D86254 


Human MHC (HLA) 
DRB intron 1 DNA, 
partial sequence 


0007 


<NONE> 


<NONE> 


<NONE> 


845 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


846 


Y07738 


M.musculus gene for 
vimentin 


0.007 


<NONE> 


<NONE> 


<NONE> 


847 


AJ005813 


Arabidopsis thaliana 
mRNA for 
neoxanthin cleavage 
enzyme 


0.007 


<NONE> 


<NONE> 


<NONE> 


848 


AF055119 


Homo sapiens alpha- 
tectorin (TECTA) 
gene, exon 6 


0.007 


<NONE> 


<NONE> 


<NONE> 


849 


M61195 


Zucchini 1- 
aminocyclopropane- 1 
carboxylate synthase 


0.007 


<NONE> 


<NONE> 


<NONE> 


850 


Y1I050 


Homo sapiens DSG3 
gene, partial intron 
and partial exon 6, 
140 bp 


0.007 


<NONE> 


<NONE> 


<NONE> 


851 


X61204 


M.voltae vhuD, 
vhuG, vhuA, vhuU & 
vhuB genes 


0.007 


<NONE> 


<NONE> 


<NONE> 


852 


AB012105 


Brassica rapa mRNA 
for SLG45, complete 
cds 


0.007 


<NONE> 


<NONE> 


<NONE> 


853 


S43SS2 


telomere; 

( minichromosome, 
repeals ) 

[Trypanosoma brucei. 
Genomic, 1 170 nt] 


0.007 


<NONE> 


<NONE> 


<NONH> 



^05 
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Nearest Neiahbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















854 


L32674 


Geomydoecus nadleri 
mitochondrial 
cytochrome oxidase I 
gene, partial cds. 


0.007 


<NONE> 


<NONE> 


<NONE> 


855 


U58732 


Caenorhabditis 
elegans cosmid 
F48D6. 


0.007 


<NONE> 


<NONE> 


<NONE> 


856 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


0.0O7. 


<NONE> 


<NONE> 


<NONE> 


857 


Z35284 


H saDiens mRNA for 
MDR3 P- 
glycoprotein 


0.007 


1730696 


HYPOTHETICAL 121.1 KD 
PROTEIN IN BI03-HXT17 
INTERGENIC REGION 
PRECURSOR YNR067c - yeast 
(Saccharomyces cerevisiae) 


9.5 


858 


X15217 


Human sno oncogene 
mRNA for sno A 
protein, ski-related 


0.007 


902455 


(U24203) membrane protein 
lEscherichia coli] 


8.8 


859 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
A) mRNA, complete 
cds 


0.007 


1684636 


(Y09454) ORE3 [Lactobacillus 
casei bacteriophage A21 


8.3 


860 


AFO 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


0.007 


3878803 


(Z48795)R05H5.7 
[Caenorhabditis elegans} 


8.3 


861 


S76317 


Tiy=l!JU-2UU kda 
membrane protein 
scavenger receptor 
homolog {clone 18. 
intron and flanking 
exons 14 and 15} 
[sheep, lymph node, 
lymphocytes. 
Genomic, 30S nt, 
sesment 2 of 21 


0.007 


294747 


(L08174) ORF2 
[Romanomermis culicivorax] 


7.4 



Ho4 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















862 


D88084 


Pedicularis 
verticil lata 
chloroplast DNA, 
intergenic region 
between tmT(UGU) 
and tmL(UAA)5'exon 


0.007 


2555187 


(AF026789) vitellogenin 
[Pimpia nipponicaj 


6.9 


863 


X58869 


Chicken mRNA for 

aldehyde 

dehydrogenase 


0.007 


115978 


CD30L RECEPTOR 
PRECURSOR 
(LYMPHOCYTE 
ACTIVATION ANTIGEN 


6.5 


864 


D87120 


Homo sapiens mRNA 
for GS3786, complete 
cds 


0.007 


3879589 


{z-juo / j j i lumii nui utniiairr, 
cDNA EST EMBL:D35637 
comes from this aene; cDNA 
EST yk322a3.5 comes from this 
gene; cDNA EST yk397b2.5 
comes from this gene; cDNA 
EST yk348bl 1.5 comes from 
this sene; cDNA EST 
yk397b2.3 comes fr... 
>gi|3880965|gnl|PrD|e 1 350578 
comes from this gene; cDNA 
ESTyk322a3.5 comes from this 
gene; cDNA EST yk397b2.5 
comes from this °enei cDNA 
EST yk348bl 1.5 comes from 
this gene; cDNA EST 
yk397b2.3 comes ... 


5.1 


865 


X68793 


H. sapiens gene for 

antithmmhin Ttl 

**l 1 11 U 11 \Ji 11 CI 11 111 


0007 


i. _> J O J. \J~J 


(AF0 10403) ALR [Homo 

cmipncl 




866 


AJ001596 


Danio rerio mRNA 
for opioid receptor 
homologue 


0.007 


2507509 


HYPOTHETICAL 2¥.i(KD 
PROTEIN IN HOLB-PTSG 
INTERGENIC REGION 
>gi| 1 787342 (AE000210) orf, 
lypotheticat protein 
Escherichia coli] protein in 
ioIB 3'region . [Escherichia 
colt] 


1.9 


867 


AF061 195 


Streptomyces albus 
valine dehydrogenase 
(Vdh) gene, complete 
cds 


0.007 


208S76S 


(AF003I45) B0414.8 gene 
product [Caenorhabditis 
elegans] 


1.9 


868 


AJ0058I3 


Arabidopsis thaliana 
mRNA for 
neoxanlhin cleavage 
enzyme 


0.007 


1710105 


UD>-N- 

ACETYLGLUCOSAMINE 2- 
EP1MERASE UDP-N- 
aceiyljiiueosamine 2-epimerase 
Plasmid pWQ799] 


1.7 



Ho 5 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BiastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Zebrafish retinoic 










869 


L03398 


acid receptor alpha 
2.A 


0.007 


2239219 


(Z97210) hypothetical protein 


0.77 


870 


L)o34o4 


Human mRNA for 
KIAA0150 gene, 
partial cds 


0.007 


19917 


(Z 140 14) Pistil extensin like 
protein, partial CDS only 


0.61 




M31483 


Maize glyceraldehyde 
3 -phosphate 
dehydrogenase, J 
end. 


0.007 


543068 


mucin, tracheobronchial - dog 
>gi|402558 


0.45 


872 


AF09OI15 


Lycopersicon 
esc u lent urn cytosolic 
class H small neat 
shock protein HCT2 
(HSPI7.4) mRNA, 
complete cds 


0.007 


249494 1 


ALPHA-2B ADRENERGIC 
RECEPTOR adrenoceptor 
[Cavia porcellus] 
>gi|1587159|prfl|2206293B 
adrenoceptor alpha2B [Cavia 
porcellus] 


0.42 


873 


AF064029 


Helianthus tuberosus 
lectin I mRNA, 
complete cds 


0.007 


1110587 


(S794 10) nuclear localization « 
signals Peptide, 140 aa] [Mus 
sp.l 


0.26 


874 


XS8931 


H. sapiens PAL 2 A 
aene 


0.007 


1706176 


CUTINASE TRANSCRIPTION 
FACTOR 1 ALPHA 
>gi 1 1 2629 1 2 (U5 167 1 ) cutinase 
transcription factor 1 [Fusarium 
solani f sp. pisi] 


0.2! 


875 




zRAR alpha =retinoic 
acid receptor alpha 
[zebrafish, embryos, 

ml5 W A 1 111 nrl 

mKiNA, i / / j nij 


u.uu/ 




Z972I0) hypothetical protein 


0. 1 1 


876 


M74193 


Petromyzon marinus 
plasma albumin 
mRNA, complete cds. 


0.007 


730888 


OCT APEPTIDE- REPEAT 
PROTEIN T2 


0.01 1 


877 


U03673 


Saccharomyces 
cerevisiae Spp4 1 p 
(SPP41) gene, 
complete cds. 


0.007 


3820885 


(AL033126) 65G3.k 
Drosophila melanogasterl 


0.001 


878 


D37766 i 


Aomo sapiens mRNA 
for Laminin-5 beta3 
chain, complete cds 


0.007 


1235974 


(X967 13) collagen [Globodera 
pallidal 


3e-06 
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Nearest Neighbor (BlastN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins ) 



SEQ 
ID 



ACCESSION 



DESCRIPTION 



P VALUE 



ACCESSION 



DESCRIPTION 



P VALUE 



Caenorhabditis 



879 



AF022388 



elegans putative 
transcription factor 
MAB-3 (mab-3) 
gene, complete cds 



0.007 



3747107 



(AF095741) unknown [Rattus 
norvesicusj 



5e-09 



880 



U89984 



Aca nth amoeba 
castellanii 
transformation- 
sensitive protein 
homolog mRNA, 
complete cds 



0.007 



1890281 



(U89984) transformation- 
sensitive^grotein homolos 



mi iu t iVjuuun. 
rabGAP domains; cDNA EST 
EMBL:D34945 comes from this 
gene; cDNA EST 
EMBL:D27313 comes from this 
gene; cDNA EST 
EMBL:D34829 comes from this 
gene; cDNA EST 
EMBL.D273 12 comes from this 
gene; cDNA ... Probable 
rabGAP domains; cDNA EST 
EMBL:D34945 comes from this 
gene; cDNA EST 
EMBL:D27313 comes from this 
gene; cDNA EST 
EMBL:D34829 comes from this 
gene; cDNA EST 
EMBL:D273 12 comes from this 
gene; cDNA ... 



2e-09 



881 



AB020689 



Homo sapiens mRNA 
forK!AA0882 
protein, partial cds 



0.007 



3880809 



le-23 



882 



AF 100694 



Mus m use u I us 
Pontin52 mRNA, 
complete cds 



0.006 



<NONE> 



<NONE> 



<NONE> 



883 



AF027173 



Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath 
A) mRNA, complete 
cds 



0.006 



<NONE> 



<NONE> 



<NONE> 



884 



U76524 



Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 



0.006 



<NONE> 



<NONE> 



<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlasOC vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















■ 

885 


U76524 


ribosome inactivating 
protein precursor 
mRNA. complete cds 


0.006 


<NONE> 


<NONE> 


<NONE> 


8S6 

\J v 


AJ0058I3 


Arabidopsis thaliana 
mRNA for 

npnvinlnm rl<*TVTOP 

ncuAuninjn cicuvagc 

enzyme 


0.006 


<N0NE> 


<NONE> 


<N0NE> 


887 


A T5 A 1 ") I HA 


Brassica rapa mRNA 
for SRK45, complete 
cds 










888 


MoUj2V 


Rattus norvegicus 
ceruloplasmin gene, 
exon l ano j uann 


U.UUD 








889 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic suuunn i /a u i • 
A) mRNA, complete 
cds 


0.006 


99408 


hypothetical protein 6 - 
Chlamydomonas reinhardtii 

U UUjUvjvI 1 

>gi| 1 3607 17|gnl|PID|e3346 1 
reinhardtii] 


9.6 


890 


U76523 


jdUiuUCub niHiu icvuu 
precursor mRNA t 
complete cds 


0.006 


4039024 


(AF039IIO) polyprotein 
[Rubella virus] 


9.3 


891 




Rattus norvegicus 
homer- 1c mRNA, 
complete cos 




1 UUJ J J 


(M9442S) merozoite surface 

nntio^n 1 fPla^mrvliiirn vivnxl 
tlllll^CIl l [r laolttuujuiii vi »a ( \| 


7 5 




AB012I06 


Brassica rapa mRNA 
lor orvJS.4j, tornpieie 
cds 


0.006 


4019458 


(Af 093984) envelope 

cjlvpnrtrnt<*ifi THitmnn 

immunodeficiency virus type 1] 


7.0 


893 


AJ005813 


Arabidorjsis thaliana 
mRNA for 
neoxanthin cleavage 
enzyme 


0.006 


1916976 


(U9I682) vitelline membrane 
protein homolog [Aedes 
aegypti] 


6.S 


894 


AF093268 


Rattus norvegicus 
homer- 1 c mRNA, 
complete cds 


0.006 


102059 


promastigote surface antigen-2 
(clone 4.6) - Leishmania major 
(fragment) >gi|9583 (X57 135) 
surface antigen P2 [Leishmania 
major] 


2.4 


895 


AF093268 


Rattus norvegicus 
homer- 1c mRNA, 
complete cds 


0.006 


3171241 


(AF067204) transcription factor 
BF-1 [Daniorerio] 


1.0 


S96 


X993S4 


M.musculus mRNA 
for paladin eene 


0.003 


<NONE> 


<NONE> 


<NON~E> 
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Nearest Neighbor (BiastN vs. Genbank) 


Nearest Neiehbor fBlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















897 


AF027174 


Arabidopsis thaliana 

catalytic subunit (Ath- 
B) mRNA, complete 
cds 


0.003 


<NONE> 


<NONE> 


<NONE> 


89S 


AE001 148 


Borrelia burgdorferi 

w 

section J4 or /v) or 
the complete genome 


0.003 


4160388 


/aiai toe £\ r\n t: riAt c e 

(AJOI 1856) ORr Q0255 
[Saccharomyces cerevisiae] 


7.6 


899 


AF027173 


Anninrvncic milling 
/MUQJUUpMb UltllLilIla 

cellulose synthase 
catalytic subunit (Ath- 
A) mRNA» complete 
cds 


0.003 


1709213 


NUCLEAR ENVELOPE PORE 
MEMBRANE PROTEIN POM 
121 (PORE MEMBRANE 
PROTEIN OF 121 KD) (P145) 


1.5 


900 


U72396 


j-ycope i s i co n 
esculentum class II 
small heat shock 
protein Le-Hor I /.o 
mRNA, complete cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


901 


AF100694 


Mas musculus 
rontinjz mKiNA, 
complete cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


902 


AFI04631 


Chlamydomonas 
reinhardtii light 
harvesting complex II 
protein precursor 
(Lhcb3) mRNA, 
complete cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


903 


AF1 00694 


Mus muse ul us 

Pnntin^? mRNA 

complete cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


904 


AB012106 


Brass ica rapa mRNA 
for 5RK45, complete 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


905 


M2I339 


Human non-histone 
chromosomal protein 
HMG-I4 gene, 
complete cds. 


0.002 


— — . ,_ , ■— 
<NONE> 


<NONE> 


<NONE> 


906 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


0.002 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human h-lys gene for 










907 


X57103 


lysozyme (upstream 
region) 


0.002 


<NONE> 


<NONE> 


<NONE> 


908 


AF074386 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


909 


U01066 


nuiiuii V- l/^t 

promoter, partial 
sequence. 


0.002 


<NONE> 


<NONE> 


<NONE> 


910 


L28094 


D. r | a .. mDM A 

oaney irovrvrv 
sequence- 


0.002 


<NONE> 


<NONE> 


<NONE> 


911 


AD000833 


Homo sapiens DNA 
from chromosome 19- 

kb EcoRI restriction 
fragment) 


0,002 


<NONE> 


<NONE> 


<NONE> 


912 


AJ011701 


Homo sapiens TRHR 
gene promoter and 
exons 1-2, partial 


0.002 


<NONE> 


<NONE> 


<NOKE> 


913 


AF 100694 


ivius riiu^cuiuo 
Pontin52 mRNA, 
complete cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


914 




Homo sapiens retinol 
dehydrogenase gene, 
complete cos 






<NONE> 


<NONE> 


915 




Rattus norvegicus 
homer- Ic mRNA, 
complete cds 






<NONE> 


<NONE> 


916 


U67608 


Methanococcus 
jannaschii section 150 
of 150 of the 
complete eenome 


0.002 


<NONE> 


<NONE> 


<NOKE> 


917 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit ( Ath- 
A) mRNA, complete 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


918 


Z46736 


H.sapiens DNA for 
repeat region (ABM- 
C82) 


0.002 


<NONE> 


<NONE> 


<NONE> 


919 


ABOI2106 


Brassica rapa mRNA 
forSRK45. complete 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 



*f70 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






X.laevis mRNA for 










920 


Z85983 


NOVA protein 


0.002 


<NONE> 


<NONE> 


<NONE> 


0") 1 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
A) mRNA, complete 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


yZ2 


S61977 


medium-chain acyl- 
CoA dehydrogenase 
{exon 10, intron 10) 
[human. Genomic, 
1407 nt] 


0.002 


<NONE> 


<NONE> 


<NONE> 


923 


AJ0058I3 


Arabidopsis thaliana 
mRNA for 
neoxanthin cleavase 
enzyme 


0.002 


<NONE> 


<NONE> 


<NONE> 


924 


AB012105 


Brassica rapa mRNA 
for SLG45» complete 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


925 


AB012106 


Brassica rapa mRNA 
for SRK45, complete 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


926 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
A) mRNA, complete 
cds 


0.002 


<NONE> 


<NONE> 


<NONE> 


927 


X5I646 


H. sapiens DNA for 
dopamine D2 
receptor gene ' 


0.002 


3329125 


(AE001337) YopOGen 
Secretion Protein D [Chlamydia 
trachomatis] 


9.5 


P 

92 S 


$ 

AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


0.002 


465762 


HYFUlHbilCAL 112.1 KU 
PROTEIN C06G4. 1 IN 
CHROMOSOME III 
>gi|630524|pir||S44748 
C06G4.1 protein - 
Caenorhabditis elegans 
>gi|409292 (L25598) homology 
with vigilin; coded for by C. 
elegans cDNA 

GenBank:MS8954 (CELI2C9); 
putative [Caenorhabditis 


8.9 


929 


U4847S 


Human skeletal 
muscle ryanodine 
receptor gene 


0.002 


2137221 


co-repressor protein - mouse 
>eil6426l9 


6.9 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















930 


AF 100694 


Pontin52 mRNA, 
complete cds 


0002 


806536 


(Z22520) membrane protein 
[Bacillus acidopullulyticus] 


6.3 


931 


AF 100694 


N/ltic miic^nlii^ 

Pontin52 mRNA, 
complete cds 


0.002 


3881055 


(AL023844) Y48A6B.1 
[Caenorhabditis eleeans] 


5.8 


932 


AF090115 


Lycopersicon 
esculentum cytosolic 

rltc^ IT cmnll hp.it 

shock protein HCT2 

/T-I^P17 d\ mRNA 
^nji l f-"f rni\l"r\, 

complete cds 


0.002 


- 

3878330 


fV8 \Vff1\ K07A 1 4 
[Caenorhabditis elegans| 


4.8 


933 




Rattus norvegicus 
homer- lc mRNA, 
complete cds 


uuuz 


1 J 1 0*HJ 


REPLICATION PROTEIN E I 
papiiiuma virus 




934 


AF0 19660 


Mus musculus 
nuclear orphan 
receptor ROR gamma 


0.002 


1330365 


(U58757) similar to nucleotide 
pyrophosphatases 


3.9 


935 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


0.002 


1785972 


(U4695I)ORF5; Method: 
conceptual translation supplied 
by author 


3.7 


y j vj 


V00508 


riuman gene ror 
epsilon-globin. 


0.002 


1333804 


[Ruminococcus flavefaciens] 


3.5 


937 


ADu 111 


Brassica rapa mRNA 
tor SLG45 ; complete 

CU5 


o on? 


41 


(AC005531) similar to mouse 
homeodomain-interacting 
protein kinase 2; similar to 
AF077659 (PID "370^958) 


3.0 


938 


AJ005813 


Arabidopsis thaliana 
mRNA for 
neoxanthin cleavage 
enzvme 


0.002 


1070461 


ornithine carbamoyltransferase 
(EC 2. 1.3.3) - yeast 
(Saccharomyces cerevisiae) 
>gi|929866 (X83502) 
pid:e 130025 [Saccharomyces 
cerevisiae] >ei| 1008256 


2.8 


939 


S41458 


rod cGMP 
phosphodiesterase 
beta-subunit [human, 
mRNA. 3231 nt] 


0.002 


3450S83 


(AF083334) fibroin [Amheraea 
pernyi] 


1.6 



<{(?- 
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Nearest Neighbor (BlaslN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






■ Drosophila 










940 


X06286 


melanogaster Gart 
locus with genes for 
GARS=phosphoribos 
ylamineglycine 
ligase, 

AIRS=phosphoribosy 

Iformylglycinamidine 

cyclo-ligase, 

GART=gIycinamide 

ribotide 

trans formylase > :: 
gb|J02527|DROGAR 
T D.melanogaster 
Gart gene encoding 
two polypeptides with 
GAR synthase, AIR 
synthase, and GAR 
trans formylase 
enzvme activities and 
a pupal cuticle gene 
nested within intron 
A of the Gart gene. 


- 

0.002 


2662054 


(AB004651) isocitrate lyase 


1.5 


941 


AF015812 


Homo sapiens RNA 
helicase p68 
(HUMP6S) gene, 
complete cds 


0.002 


3641659 


(AB008374) alpha 3 type I 
collagen 


1.1 


942 


X78925 


H. sapiens HZF2 
mRNA for zinc finger 
protein 


0.002 


141624 


ZINC FINGER PROTEIN ZFP- 
37 (MALE GERM CELL 
SPECIFIC ZINC FINGER 
PROTEIN) 


1.0 


943 


AF074386 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


0.002 


3879997 


(Z49071) weak similarity with 
mu-type opioid receptor (Swiss 
*rot accession number (P33535) 


1.0 


944 


Z69639 


Human DNA 
sequence from 
cosmidL241B9, 
Huntingion's Disease 
Region, chromosome 
4p 16.3 contains 
polymorphic VNTR 
pYNZ32. 


0.002 


3523162 


(AF076292) TGF-beta/activin 
signal transducer FAST-lp 


0.S1 



m2> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX v S . Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















945 


AHJ74387 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


0.002 


2984161 


(AE00076I) hypothetical 
protein [Aquifex aeolicus] 


0.80 


946 


Af 093268 


Rattus norvegicus 
homer- ic mRNA, 
complete cds 


0.002 


101830 


hypothetical protein B - chestnut 
blight fungus 


0.72 


947 


AF017307 


Homo sapiens Ets- 
related transcription 
factor (ERT) mRNA, 
complete cds 


0.002 


200531 


(Ml 8071) prion protein [Mus 
musculus] 


0.72 


948 


UU383 


Drosophila 
melanogaster Ovo- 
1028aa (ovo) mRNA, 
complete cds. 


0.002 


2465207 


(AF016045) OVO-like 1 
binding protein [Homo sapiens] 


0.35 


949 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


0.002 


3834294 


(U80846) No definition line 
found [Caenorhabditis eleeansl 


0.29 


950 


AF0863I5 


Homo sapiens full 
length insert cDNA 
clone ZD52F10 


0.002 


545067 


(S63336) action potential 
broadening potassium 
channel=Shab [Aplysia, bag cell 
neurons, head ganglia. Peptide, 
905 aa] [Aplysia] 
>gi|7431l0|prf||20H375A K 
channel [Aplysia californica] 


0.15 


951 


X53096 


S. aureus genes 
encoding Sau96I 
DNA 

methyltransferase and 
Sau96I restriction 
endonuclease 


0.002 


2529575 


[AF018164) kinesin-like protein 
3C [Homo sapiens] 


0.11 


952 


AB0I2105 


Brassica rapa mRNA 
: or SLG45, complete 
cds 


0.002 


729918 


LA PROTEIN HOMOLOG (LA 
RIBONUCLEOPROTEIN) (LA 
AUTOANTIGEN HOMOLOG) 


0.092 


953 


X73973 


G.gallus RAR- 
gamma2 mRNA for 
retinoic acid receptor 


0.002 


586122 


TRICHOHYALIN 
>gi|423321|pir||A40691 
trichohyalin - sheep >gi|29594I 
(Z1S36!) trichohyalin 


0.073 


954 


i 

S41458 i 


rod cGMP 
phosphodiesterase 
jeta-subunit [human, 
mRNA, 3231 nt] 


0.002 


1017427 


[X90569) elastic litin [Homo 
sapiens] 


0.013 



WOQ102568 (fiie://C :\Docurnents and Settin g s\Lawrence\Local Settin g s\Tem porar y Internet Files\OLK1 1\WO0102568x£cl. 



J?ageJ^6jgrf 1046 



WO 01/02568 



PCT/US00/18374 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






D.metanosaster 






(U88169 1 ! similar to 




955 




defective chorion- 1 
fcl25 (dec- 1) gene. 


0 002 




molybdoterin biosynthesis 
MOEB proteins [Caenorhabditis 


n oos 


956 


AF034O99 


Laccaria bicolor 
glyoxal malate 

mRNA, complete cds 


0.002 


1825593 


[Caenorhabditis elegans] 


le-06 


957 


AF033929 


Bactrocera dorsalis 
« train Tahiti 

«J H Ulll -L Hill il 

mitochondrial D-loop 
region, complete 
sequence 


9e-04 


<NONE> 


<NONE> 


<NONE> 


958 


AB012106 


RracciVn rini mRMA 

for SRK45, complete 
cds 


8e-04 


<NONE> 


, <NONE> 


<NONE> 


959 




Homo sapiens DEAD- 
box protein (BAT1) 

nana niftiAl 

gene , partial cus 


OC-KJ 4 








960 




Human ataxin-2 
related protein 


OC-U4 








961 


AF051709 


Dendrocopos 
Ieucopterus clone 2 

repeat region 


8e-04 


<NONE> 


<NONE> 


<NONE> 


962 


X 14077 


Pea phy gene for 

phytochrome 

apoprotein 


8e-04 


<NONE> 


<NONE> 


<N0NE> 


963 


AC004497 


Homo sapiens 
chromosome 21. PI 
clone LBNL#6 


8e-04 


457146 


CL27838) rhoDtrv Drotein 
[Plasmodium yoelii) 


9.6 


964 


AF077344 


Homo sapiens 
cartilage-derived C- 
type lectin 


8e-04 


3702123 


(A JO 11707) TraD protein 
[Escherichia coli] 


8.5 


965 


X851I7 


H. sapiens epb72 gene 
exons 2,3.4,5.6,7 


8e-04 


2570059 


(AJ004687) N-4 cytosme- 
specificmMethyltransferaiie 
[Neisseria gonorrhoeae] 


6.8 


966 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


8e-04 


1345S59 


COPPER TRANSPORT 
PROTEIN CTR1 transport 
protein - yeast (Saccharomyces 
cerevisiae) gene product 
[Saccharomyces cerevisiae] 


6.7 



^15 
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SEQ 
ID 



Nearest Neighbor (BlastN vs. Gcnbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



P VALUE 



967 AF03I4O3 



968 



L29252 



969 



XI6995 



Homo sapiens 



MLL/AF4 
translocation 
breakpoi nt 
t(4;ll)(q2I;23) 



8e-04 



2498926 



Human (clone DI3^2)j 
L-iditol-2 
dehydrogenase gene, 
exon 4, exon 5, exon 
6 and exon 7. 



8e-04 



1488070 



Mouse N 10 gene for 
a nuclear hormonal 
binding receptor 



SiMALL PROTEIN B 
HOMOLOG A43259. from E. 
hirae [Mycoplasma 
pneumoniae] 



(U63997) putative transposase 
[ Enterococcus faecium ] 



8e-04 



1493833 



(U47323) stromal cell protein 
[Mus musculus] 



6.6 



5.2 



3.2 



970 M99412 



-luman interleukin-8 
receptor (DL8RB) 
gene, complete cds 



8e-04 



1346101 



971 



U37452 



972 I AF 100694 



973 I AF093268 



-luman Down 
Syndrome region of 
chromosome 21 
genomic sequence, 
clone A3ID6-1C5. 



4-AMINOBUTYRATE 
AMINOTRANSFERASE 
TRANSAMINASE) (GABA 
AMINOTRANSFERASE) 
homolog - smut fungus 
(Ustilago maydis) >gi|S8l562 
Emericella nidulans gamma- 
amino-n-butyrate transaminase 
Swiss-Prot Accession Number 
P 140 10 [Ustilago maydis] 



8e-04 



4164069 



Mus musculus 
Pontin52 mRNA, 
complete cds 



8e-04 



1352877 



R alius norvegicus 
homer- 1c mRNA, 
complete cds 



8e-04 



1788557 



(AFi 1 1093) latrophilin 3 splice 
variant bbah [Bos taurus] 



HrWIHbllLAL I3.UKJJ 
PROTEIN IN RAD26-GEF1 
INTERGENIC REGION 
>gi|1077881|pir||S57057 
probable membrane protein 
YJR038c - yeast 
(Saccharomyces cerevisiae) 
>gi| 101 5688 (749538) ORF 
YJR038c putative 
[Saccharomyces cerevisiae] 



(AE0003I2) orf, hypothetical 
?rotein [Escherichia coli] 



0.8! 



0.26 



0.23 



0.19 
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Nearest Neighbor (BiastN vs. Genbank) 


Nearest Neighbor fBlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















974 


X83872 


H. vulgaris mRNA for 
cAMP response 
element binding 
protein 


8e-04 


1 175386 


HYPOTHETICAL iLlKD 
PROTEIN C 1 8B 1 1 .06 IN 
CHROMOSOME I 
>gi|2130289|pir||S58305 
hypothetical protein 
SPAC18B1 1.06 - fission yeast 
hypothetical protein 
[Schizosaccharomyces pombe] 


0.005 


975 


M32514 


Rat simple sequence 
DNA, clone 5- 


8e-04 


2394492 


(AF024502) No definition line 
found [Caenorhabditis elesans] 


0.002 


976 


AF074386 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


8e-04 


2981631 


(AB012223) ORF2 [Canis 
familiaris] 


0.001 


977 


X89211 


H. sapiens DNA for 
endogenous retroviral 
like element 


8e-04 


2065210 


(Y12713) Pro-Pol-dUTPase 
poly protein 


3e-04 


978 


U14391 


Human mvosin-IC 
mRNA, complete cds. 


8e-04 


3142302 


(AC0024I I) Strong similarity to 
myosin heavy chain gb|Z34293 
from A. thai i ana. [Arabtdopsis 
thaliana] 


4e-16 


979 


L13612 


Drosophila 
melanogaster dead- 
box protein 
D.melanogaster 
DEAD-box gene, 
complete CDS 


8e-04 


3776027 


(AJ010475) RNA helicase 
[Arabidopsis thaliana] 


9e-24 


980 


AF074386 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


7e-04 


<NONE> 


<NONE> 


<NONE> 


981 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


7e-04 


<NONE> 


<NONE> 


<NONE> 


982 


AF093268 


Rattus norvegicus 
homer lc mRNA, 
complete cds 


7e-04 


<NONE> 


<NONE> 


<NONE> 


983 


Z739S7 


Human DNA 
sequence from 
cosmid N 1 20B6 on 
chromosome 22 
Contains ESTs, 
complete sequence 
IHomo sapiens] 


7e-04 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor fBlastX vs. Non-Redundant Proteins) 


SEQ 
ID 




DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Brassica rapa mRNA 










984 


AB0I2106 


for SRK45, complete 
cds 


la. A /I 

/e-iw 


<INUfNt> 




Ul' 


985 


AF093268 


Rattus norvegicus 
homer- 1c mRNA, 
complete cds 


7e-04 


<NONE> 


<NONE> 


<NONE> 


7 00 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


7e-04 


" ' <NONE> 


<NONE> 


<NONE> 


987 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


7e-04 


<NONE> 


<NONE> 


<NONE> 


988 


AJ005813 


Arabidopsis thaliana 
mRMA tor 
neoxanthin cleavage 
enzyme 


7e-04 


<NONE> 


<NONE> 


<NONE> 


989 


AF064029 


Helianthus tuberosus 
lectin 1 mRNA, 
complete cds 


7e-04 








990 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Atn- 
B) mRNA, complete 
cds 


7e-04 


<NONE> 


<NONE> 


<NONE> 


991 


ArUZ / J / J 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit ( Ath- 
A) mRNA, complete 
cds 




<NOH9> 


<NONE> 


<NONE> 


992 


AF064029 


Helianthus tuberosus 
lectin 1 mRNA, 
complete cds 


7e-04 


<NONE> 


<NONE> 


<NONE> 


993 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


7e-04 


<NONE> 


<NONE> 


<NONE> 


994 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


7e-04 


3327230 


(AB014608) KIAA0708 protein 
[Homo sapiens] 


9.5 



WO01 02568 [ fiie://C:\Documents and $ettinqs\Lawrence\Local SettingsVTemporary Internet Files\OLK11\WO0102568.cpc] 

WO 01/02568 PCT/US00/18374 



Page 420 of 1046 





Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















995 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


7e-04 


3327230 


(ABO 14608) KIAA0708 protein 
[Homo sapiens] 


9.3 


996 


AF074387 


Sambucus nigra 
hevein-Iike protein 
mRNA. complete cds 


7e-04 


3876455 


(Z93380) predicted using 
Genefinder; similar to 7tm 
receptor protein [Caenorhabditis 
elegans] 


7.1 


997 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


7e-04 


2128771 


hypothetical protein MJ1293 - 
Methanococcus jannaschii 
>gi|1591931 (U67570)M. 
jannaschii predicted coding 
region MJ1293 [Methanococcus 
jannaschii] 


6.2 


998 


U09412 


Human zinc finger 
orotein ZNT* 1 34 
mRNA, complete cds 


7e-04 


1083336 


elutathione transferase (EC 
2.5.1.18) piA - mouse 


5.4 


999 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Aih- 
A) mRNA, complete 
cds 


7e-04 


473515 


(M176I9) NADH 
dehydrogenase subunit ND4 
[Asterina pectinifera] 


3.7 


1000 


AF012899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


7e-04 


1724097 


(U79772) female sex protein 
[Mercurial is annua] 


3.3 


1001 


AF 100694 


Mus musculus 
Pontin52 mRNA. 
complete cds 


7e-04 


1197103 


(D49747) core, env, and part of 
E2/NS1 


3.2 


1002 


XI 6995 


Mouse N 1 0 gene for 
a nuclear hormonal 
binding receptor 


7e-04 


345372 


unco protein, long torm - 
Caenorhabditis elegans 
>gi|258529|bbs| 118648 
(S47168)UNC- 
5=immuno2lobulin and 
thrombospondin type 1 
transmembrane protein 
{alternatively spliced} aa] 
[Caenorhabditis elegans] 
>gi|2662596 (AF036698) C. 
elegans UNC-5 (NID:g25852) 


2.7 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. No n- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1003 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


7e-04 


4204220 


(AB022866) mobilization 
protein 


2.5 


i find 


AF093268 


Rattus norvegicus 

L f ^_ T™> XT * 

homer- 1c mRNA, 
complete cds 


7e-04 


3201550 


( Y 1 / 1 lo) iibnnogen-oinding 
protein 


2.4 


1005 


AF074386 


Sambucus nigra 
hevein-like protein 
mRNA, complete cds 


7e-04 


• 1174264 


(U45966) polyprotein [Hepatitis 
G virus] 


0.73 


1006 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
A) mRNA. complete 
cds 


7e-04 


135308 


TRANSCRIPTION FACTOR 
JUN-D 


O.CKb 


1007 


X98745 


H. sapiens EWS gene, 
intron 6, 
polymorphism 


7e-04 


728836 


!!!! ALU SUBFAMILY SP 
WARNING ENTRY 


0.001 


1008 


AJ005813 


Arabidopsis thaliana 
mRNA for 
neoxanthin cleavage 
enzyme 


7e-04 


1633564 


(U47924) C8 [Homo sapiens] 


9e-09 


IOOv 


AF074386 


Sambucus nigra 
hevein-lilce protein 
mRNA, complete cds 


6e-04 


284171 


Ig epsilon chain C region lorm 3 
- human 


1.3 




AB0I21O6 


Brassica rapa mRNA 
tor oKK4D, complete 
cds 


6e-04 


3845262 


(AE0014 14) BRAHMA 
ortnoiog (Uina neucase 
superfamily II) 


0.25 


1011 


AL034404 


Human DNA 
sequence from clone 
4l7C12on 

chromosome Xp22. 1 1 
22.2, complete 
sequence [Homo 
sapiens] 


3e-04 


<NONE> 


<NONE> 


<NONE> 


1012 


M99701 


Homo sapiens (pp2 1 ) 
mRNA, complete cds. 


3e-04 


<NONE> 


<NONE> 


<NONE> 
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SEQ 
ID 



Nearest Neighbor f BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



P VALUE 



U00227 



Ovis aries Merino 
breed DR beta-chain 
antigen binding 
domain, MHC class I 
DRB (Ovar-DRB24) 
gene, partial cds. 



3e-04 



<NONE> 



<NONE> 



AF074387 



U95I02 



Sambucus nigra 
he vein- 1 ike protein 
mRNA, complete cds 



3e-04 



<NONE> 



<NONE> 



Xenopus laevis 
mitotic 

phospho protein 90 
mRNA, complete cds 



3e-04 



999418 



(L 19655) ORF [Tomato 
ringspot virus] 



<NONE> 



<NONE> 



8.3 



AB0I2106 



Brassica rapa mRNA 
for SRK45, complete 
cds 



3e-04 



2367460 



(AF011415) putative 
pheromone receptor [Mus 
musculus] 



AJ010737 



Mus musculus DNA 
for microsatellite 3kb 
upstream Ibp gene 



3e-04 



4106549 



(AF104411) neuronal-specific 
septin 3 [Mus musculus] 



AF053I37 



Homo sapiens histone 
deacetylase 3 gene, 
exons 4, 5, 6, 7, 8, 9, 
and 10 



3e-04 



416702 



I N ADH-DEPENDENT FLAVIN 
OXIDOREDUCTASE acid- 
inducible - Eubacterium sp 
>gi|1381570 (U57489) 
NADH: flavin oxidoreductase 
IfEubaclerium sp. VPI 12708] 



7.0 



5.5 



5.3 



AF027173 



Arabidopsis thaliana 
cellulose synthase 
catalytic subuntt (Ath 
A) mRNA, complete 
cds 



cus 

Homo sapiens clone 

UWGC:y23x011 

Vom 6p2 1, complete 

sequence | Homo 

sapiens] 



3e-04 



1785789 



(Y08502) orf 1 1 Id [Arabidopsis 
thaliana] 



5.1 



AC004I73 



3e-04 



558521 



(D28917) pol> protein [Hepatitis 
\C virus] 



LI 



X57025 



Human IGF-I mRNA 
or insulin-like 
erowth factor I 



3e-04 



4206707 



(AFI 18122) putative outer 
membrane protein OmpU 



0.65 



X77090 



H sapiens IL-IRa 
sene. 



3e-04 



1065941 



(U40799) F42C5.7 gene product 
[[Caenorhabditis elegans] 



Jf'L 



0.12 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Pseudorabies virus 










1023 


M34651 


with upstream and 

downsteam 

sequences. 


3e-04 


2746853 


(AF040650) contains similarity 
to sodium- potassium-chloride 
cotransport proteins 


7e-05 


1024 


Z36011 


S.cerevisiae 
chromosome II 
reading frame ORF 
YBR142w 


3e-04 


2500537 


PkOUABLfe ATP- 
DEPENDENT RNA 
HELICASE HAS 1 
>gi|626265|pir||S47451 
hypothetical protein YMR290c 
RNA helicase [Saccharomyces 
cerevisiae] 


4e-08 


1025 


AF020286 


Dictyostelium 
discoideum 2034 
gene, partial cds 


3e-04 


1465834 


(U64857) No definition line 
found [Caenorhabditis elegans] 


6e-I4 


1026 


L26049 


Chlamydomonas 
reinhardtii dynein 
heavy chain alpha 
(ODA1 1) gene, exons 
2-15, and partial cds. 


3e-04 


3876775 


(Z81077) predicted using 
Genefinder; Similarity to Yeast 
protein 8248 (TR:G587531) 


9e-15 


1027 


AF020286 


Dictyostelium 
discoideum 2034 
gene, partial cds 


3e-04 


1465834 


(U64857) No definition line 
found [Caenorhabditis elegans] 


le-17 


1028 


X798I1 


S.cerevisiae ACT3 
gene 


3e-04 


3876090 


{Z.bVbj5) Similarity to Yeast 
uridine kinase 

(SW : URK 1_YE AST); cDN A 
EST EMBL:ZI4695 comes 
from this gene; cDNA EST 
CEMSE17F comes from this 
gene; cDNA EST 
EMBL;D67355 comes from this 
gene; cDNA EST yk209hl.5 
comes from this ge... 


7e-3 1 


1029 


AF027173 i 


Arabidopsis thatiana 
cellulose synthase 
catalytic subunit { Ath- 
A) mRNA, complete 
cds 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1030 


1 
1 

M22970 


Human pancreatic 
jhospholipase A-2 
;PLA-2) gene, exons 
1 to 3. 


2e-04 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human DNA 










1031 


Z68686 


sequence from 
cosmid N2E9 on 
chromosome 22. 
Contains EST, 
complete sequence 
{Homo sapiens] 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1032 


X95154 


H.sapiens brca2 gene 
exon 4> : : 

emb|A62779|A62779 
Sequence 20 from 
Patent WO9719110 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1033 


AJ0O5813 


Arabidopsis tha liana 
mRNA for 
neoxanthin cleavage 
enzvme 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1034 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1035 


AE001415 


Plasmodium 
falciparum 
chromosome 2, 
section 52 of 73 of 
the complete 
sequence 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1036 


AF090115 


Lycopersicon 
esculentum cytosolic 
class II small heat 
shock protein HCT2 
(HSPI7.4) mRNA, 
complete cds 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1037 


AC000958 


Homo sapiens 
(subclone 6_d9 from 
PI H21) DNA 
sequence 


2e-04 


<NONE> 


<NONE> 


<NONE> 


1038 


AF093268 


Rattus norvegicus 
homer- 1c mRNA, 
complete cds 


2e-04 


2501523 


CD59 GLYCOPROTEIN 
PRECURSOR 


7.1 


1039 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


2e-04 


2765360 


(Y 13925) cathepsin L2 [Penaeus 
vannamei] 


6.S 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












RNA POLYMEkASE 




1040 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (AuY 
B) mRNA. complete 
cds 


2e-04 


133636 


>gi|67I26|pir||RRXPLC RNA- 
directed RNA polymerase (EC 
2.7.7.48) - lymphocytic 
choriomeningitis virus (strain 
Armstrong 53b) >gi|331369 


5.2 


1041 


ABO 12 106 


Brassica rapa mRNA 
forSRK45, complete 
cds 


2e-04 


3822155 


(AF074613) type II secretion 
protein [Escherichia coli 
0157:H7] 


4.0 


1042 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


2e-04 


1718125 


REGULATORY PROTEIN E2 
>gi| 1020222 type 36] 


0.38 


1043 


XI 7058 


Sus scrofa mRNA for 
glucose transporte 
protein 


2e-04 


3341906 


( AB009593) xylose transporter 


2e-I5 


1044 


AF008216 


Homo sapiens 
candidate tumor 
suppressor pp32rl 


le-04 


<NONE> 


<NONE> 


<NONE> 


1045 


X98890 


S.tuberosum mRNA 
for inorganic 
phosphate 
transporter. StPTl 


le-04 


624126 


(U42530) a65L [Paramecium 
bursaria Chlorella virus I] 


7.9 


1046 


LI 4930 


Glycine max (Rab7p) 
mRNA. complete cds. 


9e-05 


<NONE> 


<NONE> 


<NONE> 


1047 


AJ009970 


Mus muse ul us 
thromboxane A2 
receptor gene, exon 3, 
partial 


9e-05 


<NONE> 


<NONE> 


<NONE> 


1048 


YII896 


M.musculus mRNA 
for Brx gene, partial 


9e-05 


<NONE> 


<NONE> 


<NONE> 


1049 


LI 0832 


Polistes annularis 
(clone pan48AAT) 
tandem repeat region. 


9e-05 


<NONE> 


<NONE> 


<NONE> 


1050 


AF055011 


Homo sapiens clone 
24587 mRNA 
sequence 


9e-05 


3S80586 


(Z/9/58)cUNAbM 
EMBL:D28009 comes from this 
eene; cDNA EST 
EMBL:D28008 comes from this 
gene; cDNA EST 
EMBL:D32478 comes from this 
gene; cDNA EST 
EMBL:D34508 comes from this 
gene; cDNA EST 
EMBL.D3758 1 comes from this 
gene; ... 


7.6 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Pro 


teins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1051 


U76524 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


9e05 


3024292 


RHODOPSIN >gi|2290717 
(AF000947) rhodopsin [Sepia 
officinalis] 


6.7 


1052 


Z58294 


H.sapiens CpG DNA, 
clone 34d6. forward 
read cpg34d6.ftla . 


9e-05 


3885496 


(AF064825) heparin/heparan 
sulfate N-acetylglucosaminyi N- 
deacetylase/N-sulfotransferase 
Bos taunis] 


0.65 


1053 


D8745 1 


Human mRNA for 
KIAA0262 gene, 
complete cds 


9e-05 


' 3874739 


(Z66495) similar to claustrin 
ike 


0.004 


1054 


L37092 


Mus musculus cycl in- 
dependent kinase 
homologue 


9e-05 


3080513 


(AL022598) hypothetical 
protein 


4e-09 


1055 


AF074386 


Sambucus nigra 
hevein-Hke protein 
mRNA, complete cds 


8e-05 


<NONE> 


<NONE> 


<NONE> 


1056 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


8e-05 


<NONE> 


<NONE> 


<NONE> 


1057 


AF074386 


Sambucus nigra 
he vein- 1 ike protein 
mRNA. complete cds 


8e-05 


<NONE> 


<NONE> 


<NONE> 


1058 


D 10 102 


Homo sapiens DNA 
from cos mid 
clone :844, GT repeat 
sequence 


8e-05 


<NONE> 


<NONE> 


<NONfc> 


1059 


U72396 


Lycopersicon 
esculentum class II 
small heat shock 
protein Le-HSP17.6 
mRNA. complete cds 


8e-05 


1176475 


TnTOTHETIL AL 8U.4 MJ 
PROTEIN IN SMC3-MRPL8 
IN IbROcNlL KbulUIN 
>gi|1078237|pir||S56849 
probable membrane protein 
YJL073w - yeast 
(Saccharomyces cerevisiae) 
>gi|895898(X88851) 
hypothetical protein YJL073w 
[Saccharomyces cerevisiae] 


6.0 


1060 


X71934 


H.sapiens XB gene 
tor tenascin-X, repeat 
XIII 


8e-05 


285207 


microtubule-associated protein, 
110K tau-rat>gi|207l58 
(M84156) big tau [Rattus 
norvegicus] 


3.7 
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Nearest Nciehbor (BbsiN vs. Genbank) 


Nearest Neiahbor {BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1061 


AP027I74 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


8e-05 


4049682 


( AF063866) ORF MSV092 
hypothetical protein 
[Melanoplus sanguinipes 
entomopoxvirus] 


2.1 


1062 


AF090115 


Lycopersicon 
esculentum cytosolic 
class II small heat 
shock protein HCT2 
(HSP17.4) mRNA, 
complete cds 


8e-05 


3861019 


(AJ235271) unknown 
[Rickettsia prowazekii] 


5e-14 


1063 


AF027174 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (Ath- 
B) mRNA, complete 
cds 


7e-05 


<NONE> 


<NONE> 


<NONE> 


1064 


L04193 


Human lens 
membrane protein 
(mpl9) gene, exon 
11. 


7e-05 


<NONE> 


<NONE> 


<NONE> 


106d 


X61609 


B.napus gene for 
LHC II Type III 
chlorophyll a/b 
binding protein 


7e-05 


2132314 


hypothetical protein YPRl74c - 
yeast similarity to a nuclear 
lamin from C elegans (PIR 
accession number $42157) 
fSaccharomvces cerevisiae] 


S.9 


1066 


AF064029 


Helianthus mberosus 
lectin t mRNA, 
complete cds 


7e-05 


2979422 


(AB006757) PCDH7 (BH- 
Pcdh)c [Homo sapiens] 


5.7 


1067 


AF027173 


Arabidopsis thaliana 
cellulose synthase 
catalytic subunit (AuY 
A) mRNA, complete 
cds 


7e-05 


2493696 


HYPOTHETICAL 21.5 KD 
PROTEIN (ORF 185) 
>gi|l480440 (U34204) 
ORF185; hypothetical 21.4 JcD 
protein [Brassica oleracea] 


5.2 


1068 


AF093268 


Rattus norvegicus 
homer- lc mRNA. 
complete cds 


7e-05 


2501029 


pr0ba6L£ LEUCYL-TRNA 
SYNTHETASE, 
MITOCHONDRIAL 
PRECURSOR (LEUCINE - 
TRNA LIGASE) (LEURS) 
KIAA0028 (Homo sapiens) 


1.4 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human DNA 










1069 


Z68758 


sequence from 
cosmid cN85E10 on 
chromosome 22q 1 1 .2- 
qter 


3e-05 


<NONE> 


<NONE> 


<NONE> 


1070 


X60653 


human Histone H3.3 
pseudogene (CIR- 
456) 


3e-05 


<NONE> 


<NONE> 


<NONE> 


1071 


Z58294 


H.sapiens CpG DNA, 
clone 34d6, forward 
read cpg34d6.ftla . 


3e-05 


' 1706241 


GUANYLYL CYCLASE GC-E 
PRECURSOR cyclase receptor 
[Mus musculus] 


9.6 


1072 


AF04325 1 


Homo sapiens 
mitochondrial outer 
membrane protein 
(Tom40) gene, 
nuclear gene 
encoding 
mitochondrial 
protein, exons 1 
through 6 


3e-05 


U3980 


AMINE OXIDASE [FLAVIN- 
CONTAINING] B oxidase 
(flavin-containing) (EC 1.4.3.4) 
B - human B [human, platelet. 
Peptide Partial, 520 aa] [Homo 
sapiens] 


8.9 


1073 


M31104 


Chicken progesterone 
receptor gene, 
encoding forms A and 
B . exons 1 and 2. 


3e-05 


1170841 


IG GAMMA LAMBDA 
CHAIN V-II REGION 


4.S 


1074 


AFO 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


3e-05 


543684 


ribosomal protein S3 - 
Chlamydomonas humicola 
chloroplast (fragment) 


4.2 


1075 


L22206 


Human vasopressin 
receptor V2 gene,, 
complete cds. 


3e-05 


791207 


(U20615) Gnotl homeodomatn 
protein [Gallus gallus] 


1.8 


1076 


AF093268 


Rattus norvegicus 
homer- 1c mRNA, 
complete cds 


3e-05 


3237340 


(AF033361) polyprotein 
{Hepatitis C virus] 


0.94 


1077 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


3e-05 


2879805 


(AL021813) hypothetical 
protein 


0.001 


1078 


1 

AF 100694 


Mus musculus 
Ponttn52 mRNA, 
complete cds 


3e-05 


3877951 


(2S1555) predicted using 
Genefinder 


3e-07 



^7 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1079 


AF090U5 


Lycopersicon 
esculemum cytosolic 
class II small heat 
shock protein HCT2 
(HSP17.4) mRNA, 
complete cds 


2e-05 


<NONE> 


<NONE> 


<NONE> 


1080 


AF064029 


Helianthus tuberosus 
lectin 1 mRNA, 
complete cds 


2e-05 


3880197 


(Z81132) predicted using 
Genefinder 


2.4 


1081 


AF087989 


Homo sapiens full 
length insert cDNA 
clone YX29D10 


2e-05 


113667 


!!!! ALU CLASS B WARNING 
ENTRY !!!! 


1.8 • 


1082 


AF064O29 


Helianthus tuberosus 
lectin 1 mRNA, 
complete cds 


2e-05 


474896 


(L31967) mating type protein 
[Coprinus cinereus] 


1.4 


1083 


AF064029 


Helianthus tuberosus 
lectin 1 mRNA, 
complete cds 


2e-05 


2266988 


(Y 13274) M33 polycomb-like 
protein (Mus musculus] 


0.62 


1084 


U67415 


Equus caballus UCD- 
E-CA-467 
dinucleotide repeat 
region, complete 
sequence 


le-05 


<NONE> 


<NONE> 


<NONE> 


1085 


X67277 


H. sapiens BGP gene 
for biliary 
glycoprotein, 
promoter region and 
exon 1 


le-05 


<NONE> 


<NONE> 


<NONE> 


1086 


X85H7 


H. sapiens epb72 gene 
exons 2,3.4,5,6,7 


le-05 


<NONE> 


<NONE> 


<NONE> 


1087 


U88328 


Mus musculus 
suppressor of 
cytokine sicnallin«-3 


le-05 


443877 


(Z29457) core region; 
pid:g443877 [Hepatitis C virus] 
virus] 


3.9 


1088 


Y12853 


Homo sapiens P2X7 
gene, exon 4-8 


le-05 


3878726 


]Z664^8) similar to cuticle 
collagen; cDNA EST 
£MBL:D75584 comes from this 
gene 


0.36 


1089 


AE00M40 


Borrelia burgdorferi 
(section 26 of 70) of 
the complete genome 


le-05 


33607 19 


(AJ235270) GLUTAMYX- 
tRNA AM1DOTRANSFERASE 
SUBUNIT A (gatA) [Rickettsia 
prowazekii] 


4c- 15 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1090 


AJ224M2 


Homo sapiens gamma 
adaptin gene, exon 2 
and flanking intronic 
sequences 


9e-06 


<NONE> 


<NONE> 


<NONE> 


1091 


AB000565 


Homo sapiens DNA 
for repeat sequence 
Alu . 


9e-06 


72879 


translation initiation factor IF-2 - 
Escherichia coli 


5.1 


1092 


Z78985 


H. sapiens flow-sorted 
chromosome 6 
Hindlll fragment, 
SC6pA20B4 


9e-06 


' 159975 


(M65164) 51C surface protein 
[Paramecium tetraurelia] 


4.8 


1093 


Z21677 


Tbermotoga maritima 
DNA for spc operon 


9e-06 


585879 


SOS RIBOSOM AL PROTEIN 
12 maritima >gi|437926 
(Z21677) ribosomal protein L2 


7c- 14 


1094 


AF031494 


Drosophila hydei 
Dhc7 (Threads) 
mRNA,, complete cds 


9e-06 


729377 


DYNEIN BETA CHAIN, 
CILIARY sea urchin 
(Anthocidaris crassispina) chain 
[Anthocidaris crassispina] 


4c- 18 


1095 


AF051315 


Homo sapiens 
placental protein 
17al (PP17) mRNA, 
complete cds 


4e-06 


<NONE> 


<NONE> 


<NONE> 


1096 


AC00I460 


Homo sapiens 
(subclone 2_f4 from 
BACH 107) DNA 
sequence 


4e-06 


2648304 


(AE000952) ISA 12 14-6, 
putative transposase 


6.2 


1097 


X85030 


H.sapiens mRNA for 
skeletal muscle- 
specific calpain 


4e-06 


4239857 


(ABO 16726) calpain 
[Schistosoma japonicum] 


0.006 


1098 


M75162 


Human polymorphic 
arylamine N- 
acetyl transferase 


3e-06 


<NONE> 


<NONE> 


<NONE> 


1099 


AB009999 


Rattus norvegicus 
mRNA for CDP- 
diacyl glycerol 
synthase, complete 
cds 


3e-06 


3879045 


(Z70309)R 102.6 
[Caenorhabditis elegans] 


7.3 


1100 


Z7S9S5 


H.sapiens flow-sorted 
chromosome 6 
Hindlll fragment, 
SC6pA20B4 


3e-06 


266529 


MERCURIC REDUCTASE 
(HG(II) REDUCTASE) 
>gi|418744|pir[|S30l6S 
mercury (II) reductase 


6.5 


] 101 


AB012190 


Homo sapiens mRNA 
for NeddS-activating 
enzyme hUba3, 
complete cds 


3e-06 


3877938 


(Z79697)F58H10J 
[Caenorhabditis elegans) 


6.3 
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SEQ 
ID 



Nearest Neighbor (BtastN vs. Genbank) 



ACCESSION DESCRIPTION 



P VALUE 



Nearest Neighbor f BlastX vs. Non-Redundant Proteins) 



ACCESSION 



[Homo sapiens 



1102 AF041056 



WSCR4 gene, exons 
3 and 4 



1103 X00777 



Mouse E(d) beta gene 
5' flanking region anc 
exon I 



DESCRIPTION 



P VALUE 



3e-06 



1568583 



3e-06 



1680722 



(Z80775) hypothetical protein 
Rv0044c 



(U72497) fatty acid amide 
hydrolase [Rattus norvegicus] 



L9 



0.008 



1104 D21205 



Human mRNA for 
estrogen responsive 
finger protein, 
[complete cds 



3e-06 



563127 



1 105 I Z47046 



Human cosmid 
QLL2C9 from Xq28 



(U09825) acid finger protein 
[Homo sapiensl 



le-05 



le-06 



<NONE> 



<NONE> 



<NONE> 



1106 L26261 



Human MHC class H 
[HLA-RPl gene. 



le-06 



<NONE> 



<NONE> 



<NONE> 



1107 M13402 



Rat 5S RNA gene, 
clone 5S-2. 



le-06 



<NONE> 



<NONE> 



<NONE> 



1108 X68793 



1109 



H. sapiens gene for 
antithrombin III 



le-06 



<NONE> 



<NONE> 



<NONE> 



Homo sapiens 
Krueppel family zinc 
AF003540 finger protein 



le-06 



2507553 



ZINC FINGER PROTEIN 33A 
(ZINC FINGER PROTEIN 
KOX31)(KIAA0065) 
HA0946) Kruppel-related. 
Homo sapiens] 



0.098 



1 1 10 L42096 



Homo sapiens 
(subclone 10_d2 from 
PI H21) DNA 
sequence. 



le-06 



1330401 



(U58762)T27F7.1 gene product 
Caenorhabditis eleaans] 



0.015 



1111 Z69925 



(Human DMA 
sequence from 
cosmid cNI 16A5, 
between markers 
D22S280 and 
D22S86 on 
chromosome 22ql2 
contains EST 



11121 D90217 



S. cerevisiae gene for 
YmL33, 
[mitochondrial 
I ri bosom a I proteins of 
[large subunit 



9e-07 



<NONE> 



9e-07 



3879097 



<NONE> 



(Z81 109) predicted using 
Genefinder; similar to 
sodium/phosphate transporter; 
cDNA EST yk326f6.3 comes 
from this gene; cDNA EST 
yk326f6.5 comes from this gene 
fCaenorhabdilis elegans] 



<NONE> 



7.1 
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SEQ 
ID 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



P VALUE 



elegans cDNA ykJ4bl.:>; coded 
or by C elegans cDNA 
ykl3hl0.5; coded for by C. 
elegans cDNA yk46e8.5; coded 
or by C. elegans cDNA 
yk46d5.5; coded for by C. 
elegans cDNA yk43c2.5; coded 
"or by C. elegans cDNA 
yk46e8.... 



168 Jbbl coded tor by L. 



1113 



AF012899 



Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 



9e-07 



1330345 



1114 



AF086562 



Homo sapiens full 
length insert cDNA 
clone ZE16C03 



4e-07 



1072210 



(U40945) coded for by C 
elegans cDNA yk74b9.3; coded 
for by C. elegans cDNA 
yk74b9.5; similar to repeat of 
calcium channel alpha subunits; 
similar to tetracycline resistance 
protein; similar to hypothetical 
protein in HSP30-PMP1 region 
(SP„ 



1115 



L39062 



Homo sapiens 
intcrlcukin 9 receptor 
IL9R pseudogene, 
exons 1-9 



4e-07 



3879983 



(Z4b/yD) similar to 
transforming protein etc2; 
cDN A EST EMBL:D34 1 37 
comes from this gene; cDNA 
EST EMBL:D37172 comes 
from this gene; cDNA EST 
EMBL:D76266 comes from this 
gene; cDNA EST 
EMBL:D70493 comes from this 
gene; cDNA ... ____ 



2e-29 



3.9 



3.3 



1116 



269364 



Human DNA 
sequence from 
cosmtd L96F8, 
Huntington's Disease 
Region, chromosome 
4pl6.3 contains EST 
and cDNA. > :: 
emb|Z69365|HSL%F 
8A Human DNA 
sequence from 
cosmid L96F8, 
Huntington's Disease 
Region, chromosome 
4pl6.3 contains EST 
and cDNA. 



4e-07 



3493176 



( AF022SS9) latent TGF beta 
binding protein [Mus musculusj 



Hbi 



3.0 
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SEQ 
10 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



1117 D79986 



II 181 D43950 



11191 AF037168 



11201 X69838 



Nearest Neighbor (BlastX vs. Non-Redund ant Proteins) 
ACCESSION 



DESCRIPTION 



Human mRNA tor 



KIAA0164gene, 
complete cds 



4e-07 



403803 1 



Human mRNA for 
KIAA0098 gene, 
partial cds 



3e-07 



<NONE> 



Arabidopsis thaliana 
DnaJ homologue 
(AU6) mRNA, 
complete cds 



3e-07 



H.sapiens mRNA for 
G9a I 3e-07 



1121 AB01U24 



11221 K03012 



1123 AB016I95 



Homo sapiens mRNA| 
for KIAA0552 

rotein, complete cds | 2c-07 
Human cellular fms 
proto- oncogene, 

partial cds. | le-07 



11241 Y16795 



11251 AB012624 



1126 AJ 131341 



1127 L81902 



112S Y17968 



1129 YI390I 



Homo sapiens DNA, 
microsatellite and Alul 
repeat region 



le-07 



Homo sapiens 
psihHaA pseudogene 
Homo sapiens FLU 
gene for ERGB 
transcription fuctor, 
intron 4 and partial 
cds 



4e-08 



Homo sapiens oggl 
gene, exons i -7 



Homo sapiens 
(subclone l^clOfrom 
PI H69) DNA 
sequence 



Gallus sallus mRNA 
or high mobility 
group 1 protein 



Homo sapiens FGFR- 



4e-08 



4e-0S 



3e-08 



3e-0S 



4 gene 



3881075 



P VALUE 



(AC005936) hypothetical 
protein [Arabidopsis thalianal 



<NONE> 



3873414 



2618749 



<NONE> 



lALu^co/) predicted using 
Genefinder, similar to DnaJ 
domain ; Thioredoxin; cDNA 
EST yk433f3.5 comes from this 
gene; cDNA EST 
EMBL:D32359 comes from this 
gene; cDNA EST 
EMBL:D34721 comes from this 
gene; cDNA EST yk433f3.3 c. 



(U00043) similar to D. 
melanogaster trithorax protein 



(U90880) hypothetical protein 
2; predicted using XGrail 



728837 



<NONE> 



728836 



113668 



4225950 



<NONE> 



' f !! ALU SUBFAMILY SQ 
WARNING ENTRY 



0.30 



<NONE> 



3e-09 



3e-29 



2.0 



<NONE> 



<NONE> 



!!! ALU SUBFAMILY SP 
WARNING ENTRY 



0.095 



!! ! ALU CLASS C WARNING 
ENTRY !!H 



[AJ1 32701 ) centaurin gamma IB 



3.6 



3e-05 



1.8 



3041855 



le-08 



<NONE> 



(AC004537) similar to tumor 
uppressor p33INGl ; similar to 
AF044076 (PID:g282920S) 
[Homo sapiens] 



<NONE> 



3e-31 



<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1130 


I 29024 


Mesocricetus auraius 
serum amyloid P 
component gene, 

r-r\rn r\lp|p pric 


ic-uo 


^.1^ WIN 




<INVJIN t> 


1131 


AF012899 


Sambucus nigra 
ribosome inactivating 

LSI ULL> 111 LJI L~Vr Ul jvl 

mRNA. complete cds 


le-08 


<NONE> 


<NONE> 


<NONE> 


1132 


XI 4034 


Human mRNA for 
DhosDholioase C > " 
gb|M37238|HUMPL 
C Human 
phospholipase C 
mRNA, complete cds. 


le-08 


<NONE> 


<NONE> 


<NONE> 


1133 


Z59381 


H, sapiens CpG DNA, 
clone 152bl0, 

lUlWdlU ICUU 

cpgl52bI0.ftla . 


le-08 


<NONE> 


<NONE> 


<NONE> 


1134 


L81839 


Homo sapiens 
(subclone 2_h3 from 
PI H43) DNA 
sequence 


ie-08 


<NONE> 


<NONE> 


<NONE> 


1135 


X14448 


Hitfrnn f~tf A o*»n*» For 

alpha-D-galactosidase 
A (EC 3 1 


le-08 


3334427 


HYPOTHETICAL PROTEIN 
MJ1207 Methanococcus 
jannaschii >gi| 1591837 
(U67562) protease synthase and 
sporuiaiion negative regulator 
?ail, putative [Methanococcus 

nnnntphi 1 1 

fclllllUOl*IJ 111 


Q 1 


1136 


AL023774 


Human DNA 
sequence from clone 
799F15on 
chromosome Xq25, 
complete sequence 
Homo sapiens] 


le-08 


1354935 


;U5S330) probable copper- 
transporting atpase 


1.2 


1137 


X64639 


'A. sapiens DNA 
repetitive 
subtelomeric-like 
sequence (522 bp) 


le-OS 


77356 


hypothetical 70K protein - 
eggplant mosaic virus 


0.098 


* 

1138 


U97058 


-luman HuD gene, 
5'UTR 


5e-09 


33878S6 


(AF070530) unknown [Homo 
sapiens] 


9.5 



^33 



WO0102568 |file://C:\Pocuments and Settinqs\Lawrence\ LocajJ5Minqs^^ 

WO 01/02568 PCT/USOO/18374 



Pag e 435 of 1046 





Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






riuman jjina 










1139 


Z8218I 


sequence from 
cosmid E86DI0 on 
chromosome 22. 
contains ESTs, 
exontrap, complete 
sequence 


5e-09 


728831 


!!!! ALU SUBFAMILY J 
WARNING ENTRY 


8.4 


1140 


AJ006587 


Mus musculus mRNA 
for translation 
initiation factor eEF2 
gamma X 


5e-09 


1872200 


(U22376) alternatively spliced 
product using exon 13A 


0.64 


1141 


Y1U08 


H.sapiens WNT8B 
gene 


4e-09 


2854198 


(AF045646) contains similarity 
to collagens 


4.0 


1142 


AE001223 


Treponema pallidum 
section 39 of 87 of 
the complete genome 


4e-09 


3334189 


CELL DIVISION PROTEIN 
FTSY HOMOLOG 


1.5 


■ — .... 
1143 


Z47046 


Human cosmid 
QLL2C9 from Xq28 


4e-09 


104045 


fibroblast growth factor receptor 
Al precursor - African clawed 
frog >gi|2 14894 (M55I63) 
fibroblast growth factor receptor 
[Xenopus laevis] 


1.3 


1144 


AG000746 


Homo sapiens 
genomic DNA, 21q 
region, clone: 
Tl71Bm40 


4e-09 


113666 


!!!! ALU CLASS A WARNING 
ENTRY !!![ 


0.33 


1145 


M74002 


Human argtnine-rich 
nuclear protein 
mRNA. complete cds. 


4e-09 


387537! 


arginine rich domain, possesses 
weak similarity with the RNA 
binding domains from RNA 
sDlicino factor U 7 AF 65 KD 
subunit; cDNA EST 
EMBL:D64658 comes from this 
gene; cDNA EST 

L. I VI U . VJ\J\)(J 1- J 1-.. 

>gi|3878699|gnl|P!D|el351700 
possesses weak similarity with 
the RNA binding domains from 
RNA splicing factor U2AF 65 
KD subunii; cDNA EST 
EMBL:D64658 comes from this 
gene; cDNA EST 
EMBL:D66S29 comes f... 


3e-06 


1146 


U95094 


Xenopus laevis XL- 
INCENP (XL- 
INCENP) mRNA. 
complete cds 


2e-09 


2494337 


ENDO-M-BETA-XYLANASE 
PRECURSOR sp ] 


4.9 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neiehbor <BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 













UDP- 




1147 


U20554 


Drosophila 
melanogaster UDP- 
glucose:glycoprotein 
glucosyl transferase 
mRNA. complete cds. 


2e-09 


2499087 


CLUCOSHitiLYCOraO'nilN 

GLUCOSYLTRANSFERASE 
PRECURSOR (DUGT) 
glucosyltransferase - fruit fly 
(Drosophila sp.) 
glucosyltransferase precursor 
[Drosophila melanogaster] 


4e-24 


1148 


Z56162 


H.sapiens CpG DNA, 
clone 91c9, forward 
read cpg91c9.ftla . 


le-09 


■ <NONE> 


<NONE> 


<NONE> 


1149 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-09 


1002424 


(U25739) YSPL-1 form 1 [Mus 
musculus] 


8.9 


1150 


M85276 


Homo sapiens NKG5 
gene, complete cds. 


ie-09 


2315436 


(AF016447) No definition line 
found [Caenorhabditis elegans] 


8.3 


1151 


M94065 


Human 

dihydroorotate 
dehydrogenase 
mRNA, 3* end. 


le-09 


3892656 


(AB014464) MGC-24v [Mus 
musculus] 


6.2 


1152 


AJ131895 


Homo sapiens 
genomic CAG repeat 
element, clone 
60o2(250) 


5e-10 


<NONE> 


<NONE> 


<NONE> 


1153 


Z82181 


Human UNA 
sequence from 
cosmid E86D10on 
chromosome 22. 
contains ESTs, 
exontrap, complete 
sequence 


5e-10 


728831 


!!!! ALU SUBFAMILY J 
WARNING ENTRY 


7.9 


1154 


AJ224442 


Homo sapiens mRNA 
for putative 
methyl transferase 


5e-10 


113667 


!!!! ALU CLASS B WARNING 
ENTRY!!!! 


0.15 


1155 


AJ010230 


Homo sapiens RET 
finger protein- like 1 
aniisense transcript, 
partial 


5e-10 


728S34 


!!!! ALU SUBFAMILY SB2 
WARNING ENTRY 


0.006 


1156 


AF111116 


Homo sapiens 
silencer of death 
domains (SODD) 
mRNA. complete cds 


5e-10 


4160014 


(AF 111116) silencer of death 
domains [Homo sapiens] 


2e-03 


1157 


Z97017 


Homo sapiens mRNA 
for hypothetical 
proiein 


4e-10 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor f BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



1 158 1 AF00I298 



1159| Y11395 



1160 



1161 



1162 



P VALUE 



Homo sapiens type II 
integral membrane 
protein 



H.sapiens mRNA for 

0 

Human non-coding 
|sequence upstream 
from DOC-2 gene on 
U4 1 096 Ichromosome 5 



4e-l0 



2e-10 



iSambucus nigra 
Iribosome inactivating 

I o 

I protein precursor 
AF012899 mRNA, complete cds 



IS. cere vis iae 
chromosome II 
(reading frame ORF 
Z361I1 YBR242w 



2e-l0 



6e-U 



6e-ll 



1 163 1 D89174 



Schizosaccharomyces 
pombe mRNA. partial 
cds, clone: SY 1004 



1 164 I Z95437 



Human DNA 
sequence from 
cosmid A 1 on 
Ichromosome 6 
contains ESTs. 
HERV like retroviral 
sequence 



6e-l I 



Nearest Neighbor (BlastX vs. N on-Redundant Proteins) 



ACCESSION 



5e-Il 



H65| AF012899 



1 166| X56997 



11671 AF086253 



Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 



Sell 



Human UbA52 gene 
coding for ubiquitin- 
52 amino acid fusion 



Homo sapiens full 
length insert cDNA 
clone ZD40G1: 



2e-Il 



2e-ll 



DESCRIPTION 



P VALUE 



<NONE> 



1000340 



728837 



<NONE> 



2213560 



3879758 



<NONE> 



3886065 



<NONE> 



(U34384) CheW [Borrelia 
[burgdorferi ] 



!!!! ALU SUBFAMILY SQ 
[WARNING ENTRY 



<NONE> 



<NONE> 



2.4 



0.28 



<NONE> 



l?u. 7 . 05 :.! ^ Ypo thctical Proton | 3e-27 
\[£ovZZi)) birmlanty to yeast 

protein TREMBL ID E2 46895); 

cDNA EST EMBL:T00018 

comes from this gene; cDNA 

EST EMBL:C1 3908 comes 

from this gene; cDNA EST 

EMBL:C1 1656 comes from this 

gene; cDNA EST yk234a5.3 

comes from this ge... | 4e-30 



<NONE> 



2I347S0 



<NONE> 



<NONE> 



(AF106581) contains similarity 
o C4-type zinc fingers 



<NONE> 



apoptosis inhibitor IAP homolog 
- human 



4.9 



<NONE> 



fa 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1168 


AB018314 


Homo sapiens mRNA 
for KIAA077I 
protein, partial cds 


2c- 11 


3024343 


P53-B1NDING PROTEIN 
53BP2 Bbp/53BP2 [Homo 
sapiens] 


2e-ll 


1169 


Z74972 


S.cerevisiae 
chromosome XV 
reading frame ORF 
YOR064c 


2c- 11 


3041855 


(AC004537) similar to tumor 
suppressor p33lNGl; similar to 
AF044076 (PiD;g2829208) 
[Homo sapiens] 


2e-40 


1170 


Z82181 


Human DNA 
sequence from 
cosmid E86DI0on 
chromosome 22. 
contains ESTs, 
exontrap, complete 
sequence 


7c- 12 


i i 

<NONE> 


<NONE> 


<NONE> 


1171 


X77738 


H. sapiens red cell 
anion exchanger 
(EPB3, AE1, Band 3) 
gene, 3* region 


7c- 12 


2135416 


hypothetical protein - human 
>gi|288145 


0.012 


1172 


S61977 


medium-chain acyl- 
CoA dehydrogenase 
{exon 10, intron 10} 
[human, Genomic, 
1407 nt] 


6e-12 


113666 


! ! !! ALU CLASS A WARNING 
ENTRY !!!! 


0.100 


1173 


X66285 


M.musculus DNA for 
HC1 locus 


6c- 12 


854065 


(X83413) U88 [Human 
herpesvirus 6] 


2e-06 


1174 


S78744 


protein S=activated 
protein C cofactor 
(rats, liver, mRNA, 
3315 nt] 


6e-12 


2338292 


(AF009243) proline-rich Gla 
protein 2 [Homo sapiens] 


3c- 10 


1175 


X58474 


Bovine OXT gene for 
oxytocin, 5' 
noncodins resion 


2e-12 


1296429 


(L77967) small proline-rich 
protein with paired repeat 


4.1 


1176 


256314 


H. sapiens CpG DNA, 
clone 10H10, reverse 
read cpglOh lO.rtta . 


2c- 12 


293522 1 


(AF030154) pVII [bovine 
adenovirus type 3) 


2.8 


1177 


256314 


H.sapiens CpG DNA, 
clone lOh 10, reverse 
read cpglOh lO.rtl a . 


2c- 12 


2708659 


(AF037440) putative 26 kDa 
protein [Edwardsiella ictaiuri] 


2.8 


117S 


219543 


M.musculus h2- 
calponin cDNA 


2e-12 


2497945 | 


BETA SCRUIN >gi| 10 15535 
(Z47541) beta scruin [Limulus 
polyphemus] 


2e-0-i 



¥37 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






erythropoietin 










1179 


S45332 


receptor [human, 
placental. Genomic. 
8647 m] 


7e-l3 


728835 


!!!» ALU SUBFAMILY SC 
WARNING ENTRY 


0.074 


1180 


AFO 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA. complete cds 


2e-13 


<NONE> 


<NONE> 


<NONE> 


1181 


AF0 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


2e-I3 


<NONE> 


<NONE> 


<NONE> 


1182 
* 


Z59509 


H.sapiens CpG DNA„ 
clone 15al, reverse 
read cpel5al.rtla . 


2e-13 


3150251 


(AL023634) hypothetical 
protein 


0.66 


1183 


D10I70 


Human CYP11B2 
gene for steroid 18- 
hydroxylase 


2e-13 


728837 


!!!! ALU SUBFAMILY SQ 
WARNING ENTRY 


3e-05 


1184 


U65416 


Human MHC class I 
molecule (MICB) 
gene, complete cds 


2e-13 


126295 


LINE- 1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 


6e-U 


1185 


AJ00603I 


Mus musculus 
1HABP gene, 
promoter 


8e-l4 


2132223 


hypothetical protein YPL186c - 
yeast 


1.1 


1186 


U34976 


Human gamma- 
sarcoglycan mRNA, 
complete cds 


8e-14 


1054903 


(U34976) gamma-sarcoglycan 
[Homo sapiens] >2i|4239660 
sapiens] 


0.034 


1187 


D30647 


Rai mRNA for very- 
long-chain Acyl-CoA 
dehydrogenase, 
complete cds 


8e-14 


3183512 


ACYL-COA 

DEHYDROGENASE, VERY- 
LONG-CHALN SPECIFIC 
(VLCAD) >gi|2388724 
(AF017176) very- long- chain 
acyl-CoA dehydrogenase [Mus 
musculus] 


8e-23 


1188 


Z63247 


H.sapiens CpG DNA. 
clone 7g4, forward 
read cpg7g4.fla . 


6e-14 


86285 


histone Hl.01 - chicken 


6.8 


1189 


U27196 


Gallus gallus zinc 
finger protein (Fzf- 1 ) 
mRNA. complete cds. 


3e-14 


2134436 


zinc finger protein - chicken 
ffrasmenO 


4e-10 


1190 


M26219 i 


African green 
monkey origin of 
replication 


2e-14 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Mus musculus 










1191 


API 00694 


Pontin52 mRNA. 
complete cds 


2e-14 


4235641 


(AF 119040) NL0D 
[Lycopersicon esculentuml 


0.65 


1192 


AFO 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


2e-14 


3043728 


(AB01 1 174) K1AA0602 protein 
[Homo sapiens] 


0.28 


1193 


AJ005866 


Horno sapiens mRNA 
for putative Sqv-7- 
like protein, partial 


2e-14 


■ 4008517 


(AJ005866) Sqv-7-like protein 
[Homo sapiens] 


0.004 


1194 


U32709 


Haemophilus 
influenzae Rd section 
24 of 163 of the 
complete genome 


2e-14 


3861056 


(AJ235272) 

POLYRIBONUCLEOTIDE 
NUCLEOTIDYLTRANSFERA 
SE (pnp) [Rickettsia 

prowazekii] 


6e-28 


1195 


AF073485 


Homo sapiens MHC 
class I-related protein 
MR1 precursor 
(MR1) gene, partial 

cds 


8e-15 


728831 


![!! ALU SUBFAMILY J 
WARNING ENTRY 


1.0 


1196 


AF052135 


Homo sapiens clone 
23625 mRNA 
sequence 


8e-I5 


4098124 


(U73522) AMSH [Homo 
sapiens] 


8e-14 


1197 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


3e-15 


<NONE> 


<NONE> 


<NONE> 


1198 


AFO 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA* complete cds 


3e-15 


113671 


!!!! ALU CLASS F WARNING 
ENTRY !!!! 


1.7 


1199 


Z75104 


S.cerevisiae 
chromosome XV 
reading frame ORF 
YOR196c 


3e-15 


3878570 


(Z46381) similar to lipoic acid 
synthase; cDNA EST yk283b6.3 
comes from this gene; cDNA 
EST yk283b6.5 comes from this 
gene; cDNA EST yk472f5.3 
comes from this °ene; cDNA 
EST yk472f5.5 comes from this 
gene; cDN A EST yk476e7.3... 


le-15 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(U428J3) coded tor by C. 




1200 


X70052 


S cerevisiae sofl 
gene 


3e-15 


1 125754 


elegans cDN A cml6t6; coded 
for by C. elegans cDNA 
CEESU63F; similar toS. 
cerevisiae SOFl protein 
(SP:P33750) [Caenorhabditis 
elegans] 


3e-29 


1201 


AFO 12899 


Sambucus nigra 
ribosome inactivating 
protein precursor 
mRNA, complete cds 


2e-15 


' <NONE> 


<NONE> 


<NONE> 


1202 


M92295 


Gorilla gorilla gamma 
1 and gamma-2 
globin genes, 
complete cds. 


le-!5 


284078 


hypothetical protein 2 - human 
>gi|l 82220 


7.4 


1203 


L34587 


Homo sapiens RNA 
polymerase 11 
elongation factor SIIL 
p 15 subunit mRNA, 
complete cds. > 
°b|AR022286|AR022 
286 Sequence 7 from 
patent US 5792634 


9e-l6 


<NONE> 


<NONE> 


<NONE> 


1204 


D83649 


Xenopus laevis 
mRNA for xSo.x7 
protein, complete cds 


8e-16 


2447043 


(D83649) xSox7 protein 
Xenopus laevis] 


4e-06 


1205 


AC005190 


Homo sapiens PAC 
clone DJI152D16 
from Xq23, complete 
sequence [Homo 
sapiens] 


3e-16 


<NONE> 


<NONE> 


<NONE> 


1206 


J03626 


Human UMP 
synthase mRNA, 
complete cds. 


3e-16 


113667 


!!!! ALU CLASS B WARNING 
ENTRY »!!! 


0.65 


1207 


J00083 


Human Alu family 
interspersed repeat; 
clone BLURiL 


3e-16 


728836 


!!!» ALU SUBFAMILY SP 
WARNING ENTRY 


4e-06 


1208 


U70674 


VIus musculus m- 
Sumb (m-nb) mRNA. 
complete cds 


le-16 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor ( BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1209 


U66619 


Human SWl/SNF 
complex 60 KDa 
subunit (BAF60c) 
mRNA, complete cds 


ie-16 


1549247 


(U666J9) SWI/SNF complex 60 
KDa subunit [Homo sapiens J 


0.003 


1210 


U75467 


Drosophila 

me lanog aster Rga and 
Atu genes, complete 
cds 


le-16 


1658503 


(U75467) Atu [Drosophila 
melanogaster] 


5e-32 


1211 


M72709 


Human alternative 
splicing factor 
mRNA. complete cds. 


3c- 17 


<NONE> 


<NONE> 


<NONE> 


1212 


IF26556 


Human ferritin H 

(FTHLI3) 

pseudoeene. 


3e-l7 


<NONE> 


<NONE> 


<NONE> 


1213 


D32064 


Human gene for 2- 
oxoglutarate 
dehydrogenase, 
complete cds 


3e-17 


2088843 


(AF003386) F59E12.9 gene 
product [Caenorhabditis 
eleoans] 


0.12 


1214 


M76364 


Human (Papua New 
Guinean) 

Mitochondrial DNA 
control region, 
sequence 131. 


3c- 17 


114009 


APAG PROTEIN 
>gi[72927|pir||BVECAG apaG 
protein - Escherichia coli 
>gi|409 18 (X047 11)URF 
hypothetical protein 
[Escherichia coli] 


0.006 


1215 


AF017466 


Homo sapiens 
genomic sequence 
from subtetomeric 
region of 
chromosome 4q 


le-17 


3947985 


(U7S94S) MADS-box prorein 2 
[Malus domestical 


4.1 


1216 


AF004876 


Homo sapiens 
54TMp (54tm) 
mRNA. complete cds 


le-17 


4101574 


(AF004876) 54TMp [Homo 
sapiens] 


0.006 


1217 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


9e-18 


<NONE> 


<NONE> 


<NONE> 


1218 


AF086758 


Rattus norvegicus Na- 
K-2C1 cotransporter 


4e-18 


3892703 


(AL033545) putative glycine- 
rich protein [Arabidopsis 
thaliana] 


0.30 


1219 


AF020089 


Homo sapiens 
PEN11B mRNA. 
complete cds 


4e-18 


2642493 


(AF0239IO) DNA 
topoisomerase I [Physarum 
polycephalum) 


0.0S3 


1220 


X82333 


H.sapiens IRLB gene 
lexonl-3) 


4e-18 


106837 


irIB protein - human (fragment) 
>ci|33969 


2e-ll 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human mRNA tor 










1221 


AB002383 


K1AA0385 gene, 
complete cds 


4c- 18 


3228540 


(AF060181) zinc finger protein 
[Homo sapiens] 


6e~25 


1222 


VrtO AO C 

X98485 


P.vivax PV14 gene 


■ a to 

le- 18 


<NONE> 


<NONE> 


<NONh> 


1223 


L 7 905 7 


H.sapiens flow-sorted 
chromosome 6 
HindllJ fragment. 
SC6pA2 1E8 


t — 1 o 

le-18 


298 1 63 1 


(AB012223) ORF2 [Cants 
famiharis] 


0.001 




L01457 


Homo sapiens (clone 
JH4BI)PM-scl 
autoantigen mRNA, 
complete cds. 


ie-18 


346287 


nucleolar 100K polymyositis- 
scleroderma protein - human 
>gi|35555(X66113) PM/Scl 
lOOkD nucleolar protein [Homo 
sapiens] 


0.001 


ilij 


L02897 


Dog nonerythroid 
beta-spectnn mRNA, 
3' end. 


4e-19 


3493358 


(AB017037) nonstructural 
protein precursor [Himetobi P 
virus] 


0.12 


1226 


AB012I62 


Homo sapiens mRNA 
for APCL protein, 
complete cds 


4e-I9 


3894265 


(ABO 12 162) APCL protein 
[Homo sapiens] 


0.002 


1227 


AB01 1093 


Homo sapiens mRNA 
for KIAA0521 
protein, partial cds 


4e-19 


3043566 


(AB01 1093) KIAA0521 protein 
[Homo sapiens] 


9e-09 


1228 


X 7 8454 


X-laevis AB21 
mRNA for RPD3 
homol02ue 


4e-19 


3023945 


HISTONE DE ACETYLASE 
(HD) thahana] 


5e-34 


1229 


US8895 


Human endogenous 
retrovirus H Dl 
leaaer 

region/integrase- 
derived ORFi, 
ORF2, and putative 
envelope protein 
mRNA, complete cds 


2e-l9 


59977 


(Z 143 10) tripartite fusion 
transcript PLA2L [Human 
endogenous retrovirus] 


le-U4 


1230 


U34377 


Human tyrosine 

kinase TXK (txk) 

sene. exon 13. 

* 


le-19 


728831 


!!!! ALU SUBFAMILY J 
WARNING ENTRY 


3e-05 


1231 


X72966 


NLmusculus rab3A 
gene 


lc-19 


2408076 


(Z99167) putative peroxisomal 
organisation and biogenesis 
protein (Schizosaccharomyces 
pombe] 


2e-09 


1232 


AB007953 


Homo sapiens 
mRNA, chromosome 
1 specific transcript 
KIAA04S4 


4e-20 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(ABOOI535 ) similar to 




1233 


D 14034 


T-TiiiTVin opni* for 7n- 

alpha2-glycoprotein t 
complete cds 


2e-20 


3928756 


C.elegans hypothetical protein 

PFTfiiHR I PPrnsPO t rpp<; 
no. i ,\_cv_vj-H_. i - j,Ltrj 

4D1.5. similar to trp and trp-like 

proteins [Homo sapiens] 


le-07 


1234 


X82126 


H.sapiens HOK-2 
gene, exon 2 


2e-20 


2137269 


DNA-binding protein - mouse 
> g i|437444 


le-19 


1235 


AF093684 


Luciferase reporter 
vector pXP2 *SA,, 
complete sequence 


5e-21 


2773363 


(AF041382) microtubule 
binding protein D-CLIP-I90 


5.5 


1236 


J05272 


Human IMP 
dehydrogenase type 1 
mRNA complete cds. 


5e-21 


124417 


INOS1NE-5'- 

DEHYDROGENASE 1 (IMP 
DEHYDROGENASE 1) 
(IMPDH-I) (IMPD I) I - human 


2e-04 


1237 


D86997 


Human (lambda) 
DNA for 

immunoclnhiil in lioht 

■ I 1141 IU1 lulktv/UUI III 1 1^1 1 1 

chain 


5e-21 


3878261 


(Z75712) Similarity to S. Pombe 
BEM1/BUD5 suppressor; 
cDNA EST EMBL:Z 14470 
comes from this gene; cDNA 
EST yk482d4.3 comes from this 
gene; cDNA EST yk482d4.5 

[Caenorhabditis elegans] 


6e-46 


1238 


279865 


H.sapiens 

chromosome 22 CpG 
island DNA genomic 
Vlsel fragment, clone 
302f3, forward read 
302f3.f 


2e-21 


2739037 


(AF024614) ADAM 10 
[Caenorhabditis elegans] Zinc- 
binding metal loprotease domain; 
cDNA EST CEMSA42F comes 
from this gene; cDNA EST 
yk2 1 St3-3 comes from this gene; 
cDNA EST vk443d9.3 comes 
from this gene; cDNA EST 
yk443d9.5 comes from this 
sene; cDNA... 


2.6 
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Nearest Neighbor CBIastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


-— ■ u 

SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












xmjvrjQjtiJ) simiiJi uj ijuittttti — 




1239 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


6e-22 


3924779 


B, iDNA E3T )UjOUQ.j luim 
from this gene; cDNA EST 
yk249a6.5 comes from this 

nana.. nHWA ITCT* .,1-0 lO-l') C 

gene; clhna to i yttz iyaj..D 
comes from this gene; cDNA 
Eol yic3_De4..> comes from tnis 
gene; cDNA EST yk224f4.5 
comes fr... 

>gi|3924S8 1 |gni|PtD|e 1 354569 
from this gene; cDNA EST 
yk249a6 5 comes from this 
gene; cDNA EST yk219a2.5 
comes from this gene; cDNA 
EST yk355e4.5 comes from this 
gene;cDNA EST yk224f4.5 
comes from... 


0.35 


1240 


U67824 


Human primary Alu 
transcript 


6e-22 


728832 


!!!! ALU SUBFAMILY SB 
WARNING ENTRY 


5e-07 


1241 


AF070636 


Homo sapiens clone 
24686 mRNA 
sequence 


2e-22 


98710 


fatly -acid synthase (EC 
2.3.1.85) - Brevibacterium 
ammoniaaenes 


2.5 


1242 


D 14034 


Human gene for Zn- 
a)pha2-glycoprotein, 
complete cds 


2e-22 


4185939 


(Y 17832) pol protein [Human 
endogenous retrovirus K] 


0.29 


1243 


M61835 


Human lactase 
phlorizin hydrolase 
(LCT) gene T exon 2. 


2e-22 


72883 1 


!!!! ALU SUBFAMILY J 
WARNING ENTRY 


0.006 


1244 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


6e-23 


1350828 


RABPHILIN-3A 
>gi|477l00|pir||A48097 
rabphilin-3A - bovine 
>2i|285646|gnl|PlD|d 1003285 


0.14 


1245 


AF074985 


Homo sapiens full 
length insert cDN A 
YH73H06 


8e-24 


3170548 


( AF0561 16) unknown [Fugu 
rubripesl 


0.24 


1246 


D 14878 


Human mRNA for 
protein D123, 
complete cds 


7e-24 


<NONE> 


<NONE> 


<NONE> 


L247 


D169I7 


Human HepG2 3* 
region cDNA. clone 
hmd?d07 


6e-24 


1397345 


(U61955) contains multiple 
region of strong similarity to 
C2H2-type zinc fmgers 
(PS:PS00028) [Caenorhabditis 
elesans] 


2.4 
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Nearest Neishbor (BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non- Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human DNA 










1248 


269654 


sequence from 
cosmid L98A6. 
Huntington's Disease 
Region, chromosome 
4pl6.3- 


3e-24 


4240566 


(AF 123462) neurexin III [Homo 
sapiens] 


4.5 


1249 


AB007914 


Homo sapiens mRNA 
for KIAA0445 
protein, complete cds 


2e-24 


3885949 


(AF095568) amelogenin 
[Paleosuchus paipebrosusl 


3.2 


1250 


AFO88072 


Homo sapiens full 
length insert cDNA 
clone ZD93D10 


2e-24 


323091 


immunodominant mi crone me 
protein EtplOO - Eimeria tenella 
>gi|2707733 (AF032905) 
microneme protein precursor 
Etmic-1 [Eimeria tenella] 


0.34 


1251 


AF069489 


Homo sapiens cAMh 
specific 

phosphodiesterase 4A 
variant pde46 
(PDE4A) gene, exons 
2 through 13 and 
alternative solice 
exons 3a. 6a, 6b, and 
9a 


2e-24 


728836 


! !!! ALU SUBFAMILY SP 
WARNING ENTRY 


le-05 


1252 




Homo sapiens P2X7 
opne exon 4-8 


9e-25 


72883 1 


!!!! ALU SUBFAMILY J 
WARNING ENTRY 


le-05 


1253 




Human 28S 
ribosomal RNA gene, 

rnmnlpfp rn^ 


8e-25 


<NONE> 


<NONE> 


<NONE> 


1254 


AB007953 


Homo sapiens 
mRNA. chromosome 
I specific transcript 
KIAA04S4 


8e-25 


<NONE> 


<NONE> 


<NONE> 


1255 


260212 


H. sapiens CpG DNA, 
clone 195c8, forward 
read cpgl95c8-ftla . 


8e-25 


158154 


(M81959) POU domain protein 
[Drosophila melanogaster] 


3.3 


1256 


AF 100694 


Mus musculus 
Pomin52 mRNA, 
complete cds 


7e-25 


<NONE> 


<NONE> 


<NONE> 


1257 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


7e-25 


<NONE> 


<NONE> 


<NONE> 


1258 


Y12851 


Homo sapiens P2X7 
nene, exon 1 and 
joined CDS 


2e-25 


<NONE> 


<NONE> 


<NONE> 



WO0102568 [ file://C:\Documents and SettinqsMawrenceMocal Settinqs\Temporary Interne) Files\OLKn\WO01 02568.cp_c] Page 447 of 1046 

WO 01/02568 PC17US00/18374 





Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neiehbor fBlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Mus musculus Tera 










1259 


U64033 


(Tera) mRNA, 
complete cds 


9e-26 


<NONE> 


<NONE> 


<NONE> 


1260 


U1918I 


Kuiius norvegicus 
Rabin3 mRNA. 
complete cds. 


9e-26 


624225 


(U19181) Rabin3 [Rattus 
norvegicus] 


le-13 


1 £\J I 


AF020788 


caenornaDQius 
elegans SEL- 10 (sel- 
10) mRNA, complete 
cds 


- . 

9e-26 


• 

3915881 


SbL- FU PKU 1 blN Candida 
CDC 4 gene (TR;E234056); 
cDNA EST EMBL:D27699 

EST EMBL:D27698 comes 
from this gene; cDNA EST 
iiiviDL.uji/yj comes rrom tnis 
gene; cDNA EST 
EMBL:Dj3271 comes from this 
gen... 


* 

7e-32 


1262 


ABO 16930 


Cricetulus griseus 
mRNA for 

Phosphatidyls lycerop 
hosphate synthase* 
complete cds 


8e-26 


4159682 


(AB01o9j0) 

Phosphatidylglycerophosphate 
synthase [Cricetulus griseus] 


0.045 


1263 


AF 1 00694 


Mus musculus 
Poniin52 mRNA, 
complete cds 


3e-26 


3878629 


(Z93385) predicted using 
Genefinder; Similarity to 
B subtilis GTP-bindins protein 


2e-10 


1264 


X91195 


H.sapiensSOM172 
mRNA 


le-26 


<N0NE> 


<NONE> 


<NONE> 


1265 


AF 100694 


Mus musculus 
Pomin52 mRNA, 
complete cds 


le-26 


1360637 


(X95995) ENBP1 [Vicia sattva] 


3.1 


1266 


L08237 


Human MG21 
mRNA. partial cds. 


le-26 


950411 


(L0S237) located at OATLI 
Homo sapiens] 


9e-09 


1267 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


9e 27 


388 1080 


(AL032657) similar to EGF-like 
domain; cDNA EST yk299al2.3 
comes from this gene; cDNA 
EST EMBL:D35398 comes 
from this gene; cDNA EST 
yk331h6 5 comes from this 
gene; cDNA EST yk299al2.5 
comes from this gene; cDNA 
EST yk467aS...„ 


0.00 1 


1268 


AF1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


8e-27 


1731324 1 


HYPOTHETICAL PROTEIN 
>ui| 166306 


4.0 



fib 
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Nearest Neighbor ( Blast N vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Pro 


teins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 

















1269 


X89211 


H.sapiens DNA for 
endogenous retroviral 
i ke element 


8e-27 


2065209 


(Y 127 13) Gag polyprotein [Mus 
musculus] 


0.005 


1270 


U73166 


Homo sapiens cosmid 
clone LUCA15 from 
3p2 1.3. complete 
sequence [Homo 
sapiens] 


3e-27 


728831 


!!!! ALU SUBFAMILY J 
WARNING ENTRY 




1271 


D78255 


Mouse mRNA for 
PAP-1. complete cds 


3e-27 


1850098 


(D78255) PAP-1 [Mus 
musculus] 


AC- 1 \.J 


1272 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-27 


2133579 


spermatophorin Sp23 - yellow 
mealworm molitorl 




1273 


ABO 15202 


Homo sapiens gene 
for hippocalcin. exon 
2, 3 and complete cds 


le-27 


3877698 


(Z83318) predicted using 
Geneflnder; cDNA EST 
yk369e7.5 comes from this gene 
[Caenorhabditis elegans] 


0.37 


1274 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-27 


3328188 


(AF074902) laminin alpha chain 
[Caenorhabditis eleaans] 


0.19 


1275 


Z29336 


H.sapiens gene for 

Cu/Zn-superoxide 

dismutase 


le-27 


728831 


!!!! ALU SUBFAMILY J 
WARNING ENTRY 


6e-05 


1276 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


9e-28 


2133579 


spermatophorin Sp23 - yellow 
mealworm molitor] 


9.2 


1277 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


9e-28 


2133579 


spermatophorin Sp23 - yellow 
mealworm molitor] 


0.054 


1278 


AB001636 


Homo sapiens mRNA 
for ATP-dependent 
RNA helicase #46, 
complete cds 


4e-28 


3913425 


PUT ATlVE PRE-MRiNA 

DEPENDENT RNA 
HELICASE >gi|2275203 
(AC002337) RNA helicase 
isoloe [Arabidopsis thaliana] 


3e-22 


1279 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


3e-28 


4056454 


(AC005V90) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis lhaliana. 
ESTs gb|Z34165 and gb|ZI8788 
come from this gene. 
(Arabidopsis thaliana] 


1 0.066 
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Nearest Neighbor (BlasiN vs. Genbank) 


Nearest Neighbor (BlasiX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












(AC005^yU) Contains repeated 




1280 


API 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


3e-28 


4056454 


region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|ZI8788 
come from this gene. 
[Arabidopsis thaliana] 


4e-05 


1281 


API 00694 


Mus musculus 
Poniin52 mRNA. 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1282 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1283 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


12S4 


AF 10069 4 


Mus musculus 
Pomin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1285 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1286 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


<NONE> 


<NONE> 


<NONE> 


1287 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


140505 


PROBABLE INTRON 
MATURASE liverwort 
(Marchantia polymorpha) 
chloroplast >gi|l 1663 


3.0 


1288 


AF 100694 


Mus musculus 
Poniin52 mRNA, 
complete cds 


le-28 


140505 


PROBABLE INTRON 
MATURASE liverwort 
(Marchantia polymorpha) 
chloroplast >gi|l 1663 


1.8 


1289 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


2133579 


spermatophorin Sp23 - yellow 
mealworm molitor] 


0,50 


1290 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(ACUODyyO) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z 18788 
come from this gene. 
[Arabidopsis thaliana] 


0.0S7 


1291 


263029 


H.sapiens CpG DNA, 
clone 77b3, forward 
readcpg77b3.ftla . 


le-28 


2493240 


HYPOTHETICAL 29.3 KD 
PROTEIN pseudotsugata 
nuclear polyhedrosis virus] 


0.014 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ALLbbolON 


nccro rcrn^M 
Utoi_KLr i iuin 


D \/ \ I I IT" 

r VALUE 


ACCESSION 


L/fcoLKir 1 lUXN 


P VALUh 












DEHYDRIN DHN3 




1292 


AF100694 


Mus muscutus 
Pontin52 mRNA, 
complete cds 


le-28 


118588 


>gi|100035|pir||S18139 dehydrin 
DHN3 - garden pea >gi|20709 
(X63063) pea dehydrin DHN3 
[Pisum sativum] 


0.010 


1293 


AF 100694 


Mus muse ul us 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(ACU05990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thai i ana. 
ESTs gb|Z341 65 and gb|Z 18788 
come from this gene. 
[Arabidopsis thaliana] 


0.007 


1294 


AF 100694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|234i65 and gb|Z18788 
come from this gene. 
[Arabidopsis thaliana] 


0.002 


1295 


AF 100694 


Mus muscuius 
Ponlin52 mRNA, 
complete cds 


le-28 


126363 


LAMININ ALPHA- 1 CHAIN 
PRECURSOR precursor - 
human 


3e-04 


1296 


AF 1 00694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


( ALOOoyyO) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|Z18788 
come from this gene. 
[Arabidopsis thaliana] 


le-04 


1 OAT 


AF100694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
resion with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34!65 and gb|Z18788 
come from this gene. 
[Arabidopsis thaliana] 


3e-05 


1298 


AF100694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le-28 


3157926 


(AC002131) Strong similarity to 
extensin-like protein gb|Z34465 
from Zea mays. [Arabidopsis 
thaliana] 


2e-05 


1299 


AF 100694 


Mus muscuius 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|Z18788 
come from this gene. 
[Arabidopsis thaliana] 


le-05 
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Nearest Neighbor (BlasiN vs. Genbanfc) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEO 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












kinetoplast-associated protein - 




1300 


AF 100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


320919 


Trypanosoma cruzi >gi|162142 
(M25364) kinetoplast-associated 
protein 


Ie-07 


1301 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


/A^ WlSUOn* ("nntunc r*»npTt/»H 

region with similarity to 
gb[U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gbjZ 18788 
come from this gene. 
[Arabidopsis thaliana] 


9e-08 


1302 


AF 1 00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


\f\K*iJUj7"V} v^omains repcdieu 
region wiih similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana, 
ESTs gb|Z34165 and gb|Z 18788 
come from this gene. 
[Arabidopsis thaliana] 


le-09 


1303 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(ACUujyyu^ contains repealed 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z1873S 
come from this gene. 
[Arabidopsis thaliana] 


9e-10 


1304 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AHAOWU^ contains rcpeaicQ 
region with similarity to 
gb|U43627 extensin (aiExtl) 
gene from Arabidopsis thaliana. 
ESTs gbjZ34l65 and gb|Zl8788 
come from this gene. 
[Arabidopsis thaliana] 


4e-10 


1305 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


f A( Contains reDeated 
region with similarity to 
«b|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34l65 and gb|Zl8788 
come from this gene. 
[Arabidopsis thaliana] 


9e-ll 


1306 


AFJ00694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-28 


4056454 


(AC005990) Contains repeated 
region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z!S78S 
come from mis gene. 
[Arabidopsis thaliana) 


6e-ll 



7^0 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


" 

ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Mus muse ul us 










1307 


AF 100694 


Pontin52 mRNA. 
complete cds 


4e-29 


<NONE> 


<NO[NC> 




1308 


AF079529 


Homo sapiens cAMP- 
specific 

phosphodiesterase 8B 


4e-29 


<NONE> 




KJLV 


1309 


X93334 


H.sapiens 

mitochondrial DNA, 
complete genome 


4e-29 


116977 


CYTOCHROME C OX1DASL 
POLYPEPTIDE I chain I - 
human mitochondrion (SGC1) 

oxidase I [Homo sapiens] 
>gi|506829 (J01415) 
cytochrome oxidase subunit 1 
[Homo sapiens] sapiens] 


3e-09 


1310 


AF020760 


Homo sapiens serine 
protease (Omi) 
mRNA, complete cds 


4e-29 


2738915 


(AF020760) serine protease 
[Homo sapiens] 


8e-12 


1311 


U95097 


Xenopus laevis 
mitotic 

phosphoprotein 43 
mRNA, partial cds 


4e-29 


2072294 


{\Jyj\jyf) mitotic 
phosphoprotein 43 [Xenopus 
laevis] 


le-25 


1312 


L32162 


Homo sapiens 
transcription factor 
mRNA, 5 1 end. 


2e-29 


2501706 


RENAL TRANSCRIPTION 
FACTOR KID-1 finger protein 
[Mus musculus] 


8e-15 


1313 


AF 100694 


ivius muse ui us 
Pontin52 mRNA, 
complete cds 


le-29 


4056454 


i Af 1 onSQO('i\ CrmtnirK rpnpnted 

region with similarity to 
gb|U43627 extensin (atExtl) 
gene from Arabidopsis thaliana. 
ESTs gb|Z34165 and gb|Z 18788 
come from this gene. 
[Arabidopsis thaliana] 


le-04 


1314 


AF100694 


Mus musculus 
Pontin52 mRNA, 
complete cds 


le-29 


11 69643 


FMftf AMIDE-RELATED 
NEUROPEPTIDES 
PRECURSOR >gi|4 16208 
(U03137) neuropeptide 
precursor FMRFamide-related 
peptide [Lvmnaea stagnates] 


le-05 


1315 


U50839 


Homo sapiens gI6 
protein (gl6) mRNA, 
complete cds 


le-29 


3212101 


(AF0695 17) RNA binding 
protein DEF-3 [Homo sapiens] 


6c- 10 
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SEQ 
ID 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



13161 X697IJ 



1317 1 AF010227 



Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



H. sapiens mRNA for 
ICAM-R 



Homo sapiens 
receptor- associated 
coactivjtor 3 



5e-30 



5e-30 



DESCRIPTION 



intercellular adhesion molecule 



P VALUE 



299356 



2331250 



3, lCAM-3=lymphocyte 
function- associated antigen 1 
counter-receptor homolog 
[human, tonsil. Peptide Partial, 
518 aal 



3e-08 



(AF012108) Amplified in Breast| 
Cancer [Homo sapiens] 



8e-09 



13181 AF086395 



Homo sapiens full 
length insert cDNA 
clone ZD75C01 



2e-30 



3861241 



(AJ235273) CELL SURFACE 
ANTIGEN (scai) 



4.2 



1319 M27830 



Human 28S 
ribosomal RNA gene 
complete cds. 



2e-30 



1730522 



PHOSPHOGLYCERATE 
KINASE 2.7.2.3) - Pyrococcus 
woesei >gi| 1054832 (X73527) 
phosphoglycerate kinase 
[Pyrococcus woesei] 



1320 M79307 



13211 AL022168 



vlouse GTP-binding 

KOEein (RabI7) 

mRNA sequence. 
4uman DNA 

sequence from clone 

U247E12 on 

chromosome Xq22- 

23, complete 

sequence [Homo 

sapiens] 



2e-30 



464564 



RAS RELATED PROTEIN 
RAB-17Rabl7- mouse 
(fragment) >gi|297157 
(X708Q4) rab!7 [Mus musculus]! 



le-30 



2072967 



(U93570) putative pl50 [Homo 
sapiens] 



3.8 



9e-ll 



3e-U 



1322 X85124 



VI. musculus pacsin 
ene 



le-30 



2217964 



(Z50798) p52 [Gallus gallus] | le-34 



1 323 I U37408 



1324 L04193 



Homo sapiens 
phosphoprotein CtBP 
mRNA. complete cds 



5e-31 



74518 



Human lens 
membrane protein 
(mpl9) gene, exon 
11. 



structural poly protein - 
Venezuelan equine encephalitis 
virus (strain TRD) >gi|3237l0 
[J04332) poly-envelope protein 
Venezuelan equine encephalitis 
virus] 



2e-31 



728S31 



!!! ALU SUBFAMILY J 
WARNING ENTRY 



LI 



7e-07 



1325 I MI 1 167 



Human 28S 
ribosomal RNA gene. 



6e-32 



<NONE> 



<NONE> 



<NONE> 
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Nearest Neighbor (BiastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1326 


M33336 


Human cAMP- 
dependent proiein 
kinase type I-alpha 
subunit (PRKARI A) 
mRNA, complete cds 


2e-32 


<NONE> 


<NONE> 


<NONE> 


1327 


J03060 


Human 

glucocerebrosidase 
pseudogene, complete 
cds 


2e-32 


2144479 


glucosylceramidase (EC 
3.2. 1 .45) precursor - human 


le-05 


1328 


U33053 


Human lipid- 
activated protein 
kinase PRK1 mRNA. 
complete cds 


7e-33 


2137689 


protein kinase (EC 2.7.1.37) - 
mouse 


le-14 


1329 


J046J7 


Human elongation 
factor EF-1 -alpha 
gene, complete cds. > 
:: dbj|E02629|E02629 
DNA of human 
polypeptide chain 
elongation factor- 1 
alpha 


6e-33 


<NONE> 


<NONE> 


<NONE> 


1330 


L40396 


Homo sapiens (clone 

s22i71)mRNA 

fraament 


6e-33 


124235 


INTERMEDIATE FILAMENT 
PROTEIN B protein B - 
common roundworm 


1.00 


1331 


Z72813 


S.cerevisiae 
chromosome VII 
reading frame ORF 
YGR02Sw 


6e-33 


1709135 


MSP1 PROTEIN HOMOLOG 
Yeast MSP 1 protein (TAT- 
binding homolog 4) 


8e-50 


1332 


AB007941 


Homo sapiens mRNA 
for KIAA0472 
protein, partial cds 


2e-33 


1 150834 


(U42471) Wiscott-Aldrich 
Syndrome protein homolog 
[Mus musculus] 


2.0 


1333 


AF044574 


Rattus norvegicus 
putative peroxisomal 
2,4-dienoyl-CoA 
reductase (DCR- 
AKL) mRNA, 
complete cds 


2e-34 


4105269 


(AF044574) putative 
peroxisomal 2,4-dienoyl-CoA 
reductase [Rattus norvegicus] 


6e-15 


1334 


D 14657 


Human mRNA for 
K1AA010I gene, 
complete cds 


7e-35 


<NONE> 


<NONE> 


<NONE> 


1335 


X69910 


H.sapiens p63 mRNA 
for transmembrane 
proiein 


7e-35 


2136323 


trithorax homolog HTX - human 
(fragment) homolog=MLL 
{alternative splicing, clone I4p- 
I8B1 


0.94 
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SEQ 
ID 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



13361 AF053455 



1337 X58374 



13381 AF086492 



1339 Z96223 



1340 Z37986 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



Homo sapiens 



tetraspan TM4SF 
(TSPAN-5) gene, 
complete cds 



D.melanogaster cm 
mRNA 



Homo sapiens full 
ength insert cDNA 
clone ZD95DI1 

H.sapiens telomeric 
DNA sequence, clone 
12PTEL120, read 
12PTELOO120.seq 



DESCRIPTION 



7e-35 



3152703 



3e-35 



117478 



9e-36 



2909809 



P VALUE 



(AF065389) tetraspan NET-4 
[Homo sapiensl 



(AF031328) aminoglycoside 6'- 
N-acetvltransferase It 



3e-36 



2408068 



H.sapiens mRXA for 
jhenylalkylamine 
binding protein. 



1341 U57847 



1342 Y I 5054 



1343 AF084205 



1344 X78604 



-luman ribosomal 
protein S27 mRNA, 
complete cds. end 
simitar to similar to 
metallopanstimulin 1 
> ;: 

gb|AA316327|AA316 
327 EST 1 88061 HCC 
cell line (matastasis to 
iver in mouse) II 
Homo sapiens cDNA 
5' end similar to 
similar to 

metallopanstimulin 1 



Rattus norvegicus 
mRNA for 70 kDa 
tumor specific 
antigen, partial 



Rattus norvegicus 
serine/threonine 
protein kinase TAOl 
mRNA. complete cds 



R. norvegicus 
(Sprague Dawley) 
ARL5 mRNA for 
ARF-Iike protein 5 



le-36 



1362793 



3e-37 



1171014 



emopamil-binding protein 
human >gi[780263 



40S RIBOSOMAL PROTEIN 
S27 growth factor-inducible zincl 
finger protein MPS-1 - human 
>gi|4313I9(L!9739) 
metallopanstimulin [Homo 
sapiens] >gi[1373421 (U57847) 
ribosomal protein S27 



3e-37 



3123027 



3e-37 



3452473 



le-37 



<NONE> 



70 KD WD- REPEAT TUMOR - 
SPECIFIC ANTIGEN 
>gi|2505957|gnl|PID|e353992 
(Y 15054) 70 kD tumor-specific 
antigen [Rattus norvegicus] 



le-25 



CROOKED NECK PROTEIN 6e-4 1 



(AF084:05) serine/threonine 
protein kinase TAOl [Rattus 
norvegicus | 



1.9 



(Z99165) hypothetical protein 0.61 



5e-ll 



1.4 



2e-15 



5e 4 



<NONE> 



<NONE> 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1345 


AJ236644 


Homo sapiens 
chromosome 22 CpG 
island DNA, genomic 
Msel fragment, clone 

complete read 


Ie-37 


2239219 


(Z972I0) hypothetical protein 


6e-05 


1346 


U09367 


Human zinc finger 
protein ZNFI36 


4e-39 


2137269 


DNA-binding protein - mouse 
>g i|437444 


7e-23 


1347 


Z69649 


Human DNA 
sequence from 
cosmid L69F7B 
Huntington's Disease 
Region, chromosome 
4pl6.3 contains 

X-J ■ mtt nnt An fitCP^CA 

nuniingion L/ibtuic 
(HD) gene. 


3e-39 


- 

3096918 


(AL023094) putative cyclase 
[Arabidopsis thaliana] 


5.6 


1348 


nrUOj joy 


Homo sapiens 
teiraspan NET-4 
rnivLNA, compteic lus 






(AF065389) tetraspan NET-4 

I i 1 \J 1 1 1%J itJ Y) i CI 1 J J 


6e-°9 


1349 


AF038172 


Homo sapiens clone 
23923 mRNA 
sequence 


le-40 


1813464 


(U60883) CapC [Bacillus 
firmus] 


2.8 


1350 


Z83095 


ii.J<lL/lCIO I ♦IIIV.L'JJJ 

anaemia group A 
gene, exons 39, 40, 
41,42 and 43 


le-40 


2137870 


zinc finger protein - mouse 
(fragment) 


3e-23 


1351 


AF057734 


Homo sapiens 17- 
beta-hydroxysterord 

Hf^Kwrtrrwyf^rnCf* 1 \/ 

(HSD17B4) gene, 
exon 16 


le-40 


2842416 


(AL008730) dJ487J7 LI 
(putative protein dJ487J7. 1 
isoform 1) [Homo sapiens] 


6e-61 


1352 


AF070567 


Homo sapiens clone 
24544 beta- 
dystrobrevin mRNA, 
partial cds 


4e-41 


3133087 


( Y 1 57 1 8) dystrobrevin B DTN- 
B2 [Homo sapiens] 


7e-13 


1353 


AF006088 


Homo sapiens Arp2/3 
protein complex 
subunit pl6-Arc 
(ARC 16) mRNA. 
complete cds 


2e-4I 


3121767 


ARP2/3 COMPLEX 16KD 
SUBUNIT 


3c- 36 


1354 


X69942 


M.musculus mRNA 
of enhancer-trap- 
locus 1 


6e-42 


2291152 


(AF0 16418) No definition line 
found [Caenorhabditis elegans) 


6.4 



VS5 
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Nearest Neiehbor fBlasiN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
in 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1355 


X87838 


H.sapiens mRNA for 
beta-catenin 


5e-42 


1373019 


(U28811)cysteine-rich 
fibroblast growth factor receptor 


8e-05 


I ^ C f 

1356 


AB018268 


Homo sapiens mRNA 
for KIAA0725 
protein, partial cds 


5e-42 


3882171 


(ABO 18268) KJAA0725 protein 
[Homo sapiens] 


2e-33 


1357 


M 84424 


Human cathepsin E 
(CTSE) gene, exon 9 
and complete cds. 


2e-42 


<NONE> 


<NONE> 


<NONE> 


1358 


U80776 


Human EST clone 
NIB 1543 mariner 
iransposon Hsmarl 
orf gene, complete 
cds 


2e-42 


2231380 


(U 807 76) orr; encodes putative 
chimeric protein with SET 
domain in N-terminus with 
similarity to several other 
human, Drosophila, nematode 
and yeast proteins [Homo 
sapiens] 


3e-II 


1359 


U55184 


Human G protein 
Golf alpha gene, exon 
12 and complete cds 


2e-42 


3165531 


(AF067608) No definition line 
found [Caenorhabditis elegans] 


le-16 


1360 


AC005190 


Homo sapiens PAC 
clone DJ1152D16 
from Xq23, complete 
sequence [Homo 
sapiens] 


6e-43 


2978255 


(AB007407) myeloid zinc finger 
protein-2 [Mus musculus] 


2.3 


1361 


ABO 182 84 


Homo sapiens mRNA 
for KIAA074 1 
protein, compleie cds 


5e-43 


<NONE> 


<NONE> 


<NONE> 


1362 


AB011137 


Tomo sapiens mRNA 
for KIAA0565 
protein, complete cds 


5e-43 


3043654 


(AB011 137) KIAA0565 protein 
[Homo sapiens] 


le-07 


1 Joi 


M9365 1 


Human set gene, 
complete cds. 


2e-43 


<NONE> 


<NONE> 


<NONE> 


1364 


Z47087 


-I. sapiens mRNA for 
RNA polymerase II 
elongation factor-like 
protein. 


2e-43 


1872514 


'U84404) E6-associated protein 
E6-AP/ubiquitin-protein ligase 
Homo sapiens] >gi|2361031 
(AF01670S) E6-AP ubiquitin- 
protein ligase [Homo sapiens] 


7.2 


1365 


U27197 


Drosophila 
melanogaster pelota 
(pelo) mRNA. 
complete cds 


2e-43 


1352736 


PELOTA PROTEIN >gi|973224 
(U27197) pelota (Drosophila 
melanogaster] 


le-46 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












RRP5 PROTEIN HOMOLOG 




1366 


D80007 


Human mRNA for 
K1AA0185 gene, 
partial cds 


6e-44 


2498864 


(KIAA0185) hypothetical 
protein YM9959. 11C of 
S.cerevisiae. [Homo sapiensl 


6e-09 


1367 


AF005039 


Homo sapiens 
secretory carrier 
membrane protein 
(SCAMP3) mRNA, 
complete cds 


6e-44 


2232243 


(AF005039) secretory carrier 
membrane protein [Homo 
sapiens] 


2e-09 


1368 


X68I01 


R.norvegicus trg 
mRNA 


2e-44 


550420 


(X6810I) trg gene product 
[Rattus norvegicus] 


lc-37 


1369 


AF044206 


Homo sapiens 
cyclooxygenase 
(COX-2) gene, 
promoter and exon 1 


2e-45 


2072953 


(U93565) putative pi 50 [Homo 
sapiens) 


5e-06 


1370 


L48708 


Homo sapiens 
faciogenital dysplasia 
(FGDI)gene, 5* end 
of intron 17 


8e-46 


<NONE> 


<NONE> 


<NONE> 


1371 


XI 5822 


Human COX VIIa-L 
mRNA for liver- 
specific cytochrome c 
oxidase (EC 1.9.3.1.) 


3e-46 


117121 


CYTOCHROME C OXIDASE" 
POLYPEPTIDE VIIA-LIVER 
PRECURSOR 
>gi|2I44370|pir||OSHU7L 
cytochrome-c oxidase (EC 
1.9.3.1) chain Vila precursor, 
hepatic - human >gi|30147 
(XI 5822) precursor (AA -23 to 
60) [Homo sapiens] 


5c- 13 


1372 


U47323 


Mus musculus 
stromal cell protein 
mRNA, complete cds 


3e-46 


1493833 


(U47323) stromal cell protein 
[Mus musculus] 


lc-48 


1373 


AF059524 


Homo sapiens 
reticulon gene family 
protein 


7e-47 


1731169 


HYFU1HL11CAL 113.1 KD 
PROTEIN T28D9.7 IN 
CHROMOSOME II >gi|861264 
(U2S738) coded for by C. 
elegans cDN'A yk8h5.3; coded 
for by C. elegans cDNA 
ykSh5.5; similar to C. elegans 
des- 1 and mec-4 in exon 2 
[Caenorhabditis elegansl 


7.S 


1374 


AJ1 32583 


Homo sapiens mRNA 
for puromycin 
sensitive 
aminopeptidase, 
partial 


3e-47 


1777519 


(U39123) T cell receptor beta 
chain [Homo sapiensl 


9.7 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1375 


M97856 


Homo sapiens histone 
binding protein 
mRNA, complete cds. 


3e-47 


2645327 


(U83821)NADH 
dehydrogenase subunit 3 
[Oryzomys palustris] 


5.7 


1 

1376 


U53220 


Human 

retinoblastoma- 
related Rb2/pl30 
gene, 5' flanking 
region and partial cds 


3e-47 


2499225 


CMP-SIALIC ACED 
TRANSPORTER CMP-sialic 
acid transporter [Cricetulus 
griseus] 


5.3 


1377 


X87870 


H.sapiens mRNA for 
utpuiucyie nuLiCtir 
factor 4a 


ie-47 


728832 


'ft at Tl QTTlPAIUn Y 

.... r\JL> U OuD i rviVl I jO 

WARNING ENTRY 


7.3 


1378 


AF06OI95 


Mus musculus 
proteasome regulator 
PA28 beta subunit 
gene, complete cds 


3e-48 


478681 


limb deformity protein - chicken 


0.25 


1379 


ABO 18285 


Homo sapiens mRNA 
for KIAA0742 
protein, partial cds 


le-48 


3122969 


t£5tis specific protein 

A C7INC FTNGFR PROTFIN 
TSGA) >gi|28l040|pir||S28499 
probable zinc finger protein - rat 
>gi|5 7504 (X59993) zinc finger 
protein 


le-30 


1380 


U35032 


Human endogenous 
retrovirus clone 
c5.ll,HERV-H 
multiply spliced 
subgenornic leader, 

piUlCai>C dHU inLCgld^C 

region mRNA, partial 
cds 


4e-49 


88558 


retroviral proteinase-! ike protein 
- human 


6e-05 


1381 


AB007956 


Homo sapiens 
mRNA, chromosome 
1 specific transcript 
K1AA0487 


le-49 


<NONE> 


<NON£> 


<NOKE> 


1382 


D86987 


Homo sapiens mRNA 
for KIAA0214 
protein, complete cds 


le-49 


2497944 


ALPHA SCRUIN >gi|633238 
(Z38132) scruin [Limulus 
polyphemus] 

>gi|1093326|prf||2103269A 
scrulin [Limulus sp ] 


9.7 


1383 


U25826 


Human transcription 

factor (SCI) B en ^' 
complete cds. 


4e-50 


<NONE> 


<NONE> 


<NONE> 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Mus muscutus ATP- 










1384 


U46690 


dependent RNA 
helicase mRNA, 
partial cds. 


4e-50 


1335873 


(U46690) ATP-dependent RNA 
helicase [Mus musculus J 


3e-24 


1385 


AF072128 


Mus musculus 
claudin-2 mRNA, 
complete cds 


2e-50 


3335184 


(AF072128) claudin-2 [Mus 
musculus] 


4e-z4 


13o6 


AF093593 


Homo sapiens 
snRNA activating 
protein complex 
19kDa subunit 
(SNAP 19) mRNA, 
complete cds 


le-50. 


3668416 


(AF093593) snRNA activating 
protein complex 19icDa subunit 
[Homo sapiens] 


0.003 


1387 


U79745 


Homo sapiens 
monocarboxylate 

transporter 
homologue MCT6 
mRNA, complete cds 


le-50 


1177607 


(X92485) pval [Plasmodium 
vivaxl 


2e-07 


1388 


L09647 


Rattus norvegicus 
hepatocyte nuclear 
factor 3a 


le-50 


404764 


(L10409) fork head related 
protein [Mus musculus] 


2e-21 


1389 


X61506 


Mouse E46 mRNA 
for E46 protein 


4e-51 


1 14909 


BRAIN PROTEIN E46 


It- 20 


1390 


M33387 


Human debrisoquine 
4-hydroxylase 
(CYP2D8P) and 


le-51 


126296 


L1NE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG protein 
[Nycticebus coucang] 


5e-15 


1391 


ArO 19767 


Homo sapiens zinc 
finger protein (ZPRl) 
mRNA, complete cds 


4e-jz 


VO 1 JU/ 


\DOj iqo) ancnor proicin. n_ivi 




1392 


Z37986 


H. sapiens mRNA for 
phenylalkylamine 
binding protein. 


2e-52 


<NONE> 


<NONE> 


<NONE> 


1393 


U65416 


Human MHC class I 
molecule (MICB) 
sene, complete cds 


2e-52 


3S78637 


(Z49I28) weak similarity with 
SINR protein (Swiss Prot 
accession number P06533); 
cDNA EST EMBL.T0O631 
comes from this gene; cDNA 
EST yk293dl0.5 comes from 
this gene [Caenorhabditis 
eleuans] 


8.7 
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SEQ 
ID 



Nearest Neighbor ( BlastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



P VALUE 



1394 1 Z57647 



H.sapiens CpG DNA, 
clone 189a6. forward 
read cp.gl39a6.ftla . 



_Nearest Neighbor (BlastX vs. Non-Red,, nHnnr Proteins) 



ACCESSION 



DESCRIPTION 



beta-gloom DNA-bmding 



P VALUE 



2e-52 



1395 I Li 3738 



Human activated 
p2lcdc42Hs kinase 
(ack) mRNA. 
complete cds. 



1 i 1 187 



2e-52 



2921447 



1396 



Homo sapiens spindle | 
pole body protein 
Jspc97 homolog GCP2| 
AFQ42379 mRNA, complete cds 



1397 



1398 



1399 S60754 



1400 D86972 



{ VNTR locus DX24, 
hypervariable tandem 
repeat cluster} 
[human, Genomic, 
2991 nt] > :: 
gb|L07935|HUMVNT 
RA Homo sapiens 
microsatellite VNTR 
DNA sequence. 
[Human mRNA for 
KlAA021Sgene, 
[complete cds 



7e-53 



2801701 



Homo sapiens RNA 
[polymerase I 40kD 
Isubunit mRNA, 

AF04744 1 complete cds | 6e~53 

Homo sapiens cell 
cycle protein 
(PA2G4) gene, exons 
[6 through 13, and 
AF1 04670 [complete cds J 2e-53 



3914807 



<NONE> 



2e-53 



le-53 



1209669 



3426041 



protein Bl, transcription factor 
PU.l - mouse >°i|200586 
(M32370) PU.l protein [Mus 
musculusl >gi|200972 
(M38252) transcription factor 
Pu. I (Mus musculusl 



5.8 



(AF037260) non-receptor 
protein tyrosine kinase Ack 
[Mus musculusl 



7e-23 



(AF042379) spindle pole body 

rotein spc97 homoloe GCP2 I I e- 1 6 
>NA-blkLL ILL) RNA 

[POLYMERASE I 40 KD 
POLYPEPTIDE (RPA40) 
(RPA39) >gi|2266929 
(AF008442) RNA polymerase I 
subunit hRPA39 [Homo 
sapiens] | 4 e _io 



<NONE> 



<NONE> 



(U38810)CAGR1 [Homo 

sapiens] >gi|3098420 

( AP040945) homeotic regulator 

homolog MAB2I [Mus 

musculus] 



(AC005168) unknown protein 
[Arabidopsis thaliana] 



4.6 



9.1 
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Nearest Neighbor ( BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1401 


AJ236682 


chromosome 22 CpG 
island DNA. genomic 
ivisc i iragmenu ctonc 
22CGIB49E6 , 
complete read 


7e-54 


392872 1 


(AJ-UJ435D) putative 
cytochrome oxidase subunit I 
(Streptomyces coeiicolor] 


0.30 


1402 




Homo sapiens 
chromosome 22 CpG 
isianu L/iN genomic 
Msel fragment, clone 
22CGIB49E6 , 
complete read 


6e-54 


3928721 


(AL034355) putative 
cytochrome oxidase subunit I 
[Streptomyces coeiicolor] 


0.28 


1403 


M37583 


Human histone 
(H2A.Z) mRNA, 
complete cds. 


6e-54 


70711 


histone H2A.F, embryonic - 
chicken 


2e-16 


1404 


AJUUy947 


Homo sapiens mRNA 
for putative ATPase, 
partial 


6e-54 


35D0295 


(AJ009947) putative ATPase 
[Homo sapiens] 


3e-lS 


1405 


Y08459 


B.taurus mRNA for 
novel cytoplasmic 
protein 


2e-54 


<N0NE> 


<NONE> 


<NONE> 


1406 


AF042384 


Homo sapiens BC-2 
protein mRNA, 
complete cds 


2e-54 


2828147 


(AF042384) BC-2 protein 
Homo sapiens] 


2e-I4 


1407 


AF042379 


Homo sapiens spindle 
pole Docy protein 
spc97 homolog GCP2 
mRNA, complete cds 


Se-55 


2801701 


(AF042379) spindle pole body 
protein spc97 homolog GCP2 


2e-17 


1408 


AF005355 


Oryctolagus 
cuniculus translation 
initiation factor 
eIF2C mRNA, 
complete cds 


7e-55 


3253159 


(AF005355) translation 
initiation factor eIF2C 


3e-53 


1409 


AF008442 


Homo sapiens RNA 
polymerase I subunit 
hRPA39 mRNA, 
complete cds 


3e-55 


3335138 


(AF047441) RNA polymerase I 
40kD subunit [Homo sapiens] 


3e-20 


1410 


AF047441 


-lomo sapiens RNA 
polymerase I 40k D 
subunit mRNA, 
complete cds 


3e-55 


3335138 


( AF04744 1 ) RNA polymerase I 
40kD subunit [Homo sapiens] 


3e-20 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






Human mRNA for 










L411 


X08004 


Rap IB protein > :: 
embiA08693|A08693 
H.sapiens rap lb 
cDNA 


2e-55 


539995 


transforming protein rap lb - rat 
(strain Copenhagen) 


2c- 18 


1412 


AFO 10403 


Homo sapiens ALR 
mRNA. complete cds 


2e-55 


2358285 


(AF01O403) ALR [Homo 
sapiens] 


le-49 


1413 


— — 

M77016 


Human iropomodulin 
mRNA, complete cds. 


8e-56 


262249 


(S520I0)orfl 5* ofEpoR [mice, 
Peptide. 85 aa] [Mus sp.] 


0.027 


1414 


AB020633 


Homo sapiens mRNA 
for KIAA0826 
protein, partial cds 


2e-56 


<NONE> 


<NONE> 


<NONE> 


1415 


X87489 


H.sapiens genomic 
DNA (chromosome 
3;cloneNL1243D) 


2e-56 


1814029 


(U84501) cuticle collagen 
[Caenorhabduis briggsae) 


0.038 


1416 


AB0O7893 


Homo sapiens 
KIAA0433 mRNA, 
partial cds 


2e-56 


2887437 


(AB007893) KIAA0433 [Homo 
sapiens] 


9e-21 


1417 


X78925 


H.sapiens HZF2 
mRNA for zinc finger 
protei n 


le-56 


3342002 


(AF054180) hematopoietic cell 
derived zinc finger protein 
(Homo sapiens] 


2e-21 


1418 


Z5628I 


H.sapiens mRNA for 
interferon regulatory 
factor 3 


9e-57 


2497442 


INTERFERON 
REGULATORY FACTOR 3 
factor 3 [Homo sapiens] 


2e-21 


1419 


U78772 


Homo sapiens nuclear 
VCP-Iike protein 
NVLp.l 


8e-57 


2406565 


(U68140) nuclear VCP-Iike 
protein NVLp.2 [Homo sapiens] 


5e-20 


1420 


D79994 


Human mRNA for 
KIAA0172 gene, 
partial cds 


3e-57 


t 1 36404 


(D79994) similar to ankyrin of 
Chromatium vinosum. [Homo 
sapiens] 


9e-3S 


1421 


AB0O2342 


Human mRNA for 
KIAA0344 gene, 
complete cds 


Ie-57 


2224629 


(AB002342) KIAA0344 [Homo 
sapiens] 


4e-20 


1422 


L 19437 


Human iransaldolase 
mRNA containing 
transposable element, 
complete cds 


le-57 


1553119 


(U63159) iransaldolase [Mus 
musculus] 


2e-20 


1423 


D17532 


Human mRNA for 
RCK. complete cds 


9e-5S 


129376 


PROBABLE ATP- 
DEPENDENT RNA 
HELICASE P54 (ONCOGENE 
RCK) (DEAD BOX PROTEIN 
6) 


le-10 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















1424 


X79568 


H.sapiens BDP1 
mRNA for protein- 
tyrosine- phosphatase 


9e-58 


1871531 


(X79568) protein -tyrosine- 
phosphatase 


le-22 


1425 


X79568 


H.sapiens BDP1 
mRNA for protein- 
tvrosine- phosphatase 


9e-58 


1871531 


(X79568) protein-tyrosine- 
phosphatase 


9e-23 


1426 


ABO 12295 


Homo sapiens 
HKE1.5 mRNA for 
GDS- related protein, 
complete cds 


7e-58 


2648021 


(Z97184) RGL2 [Homo sapiens] 


9e-19 


1427 


AF086040 


Homo sapiens full 
length insert cDNA 
clone YX52E07 


le-58 


543222 


glutamine (Q)-rich factor 1, 
QRF-1 - mouse factor 1, QRF-1 
(mice, B-cell leukemia, BCL1, 
Peptide Partial, 84 aa] 


3e-36 


1428 


AB018195 


Homo sapiens ca xi 
mRNA for carbonic 
anhydrase- related 
protein XI. complete 

cds 


4e-59 


<NONE> 


<NONE> 


<NONE> 


1429 


AF071777 


Mus musculus IRE I 
(Irel) mRNA, 
complete cds 


4e-59 


3766209 


(AF071777) ERE1 [Mus 
musculus] 


7e-28 


1430 


AB000462 


Homo sapiens mRNA 
for SH3 binding 
protein, complete cds, 
clone:RES4-23A 


3e-59 


<NONE> 


<NONE> 


<NONE> 


1431 


AF038172 


Homo sapiens clone 
23923 mRNA 
sequence 


3e-59 


3758855 


(Z9855DMAL3P6.il 
[Plasmodium falciparum] 


1.3 


1432 


Z84812 


Human DN A 
sequence from phage 
pTEL from a contig 
from the tip of the 
short arm of 
chromosome 16, 
spanning 2Mb of 
I6pl3.3 Contains 
ESTs 


le-59 


400927 


RIBONUCLEOPROTEIN 
RB97D ribonucleoprotein 
[Drosophila melanogaster] 


2.5 


1433 


U36484 


luman laminin- 
binding protein gene, 
partial cds. and E2 
small nucleolar RNA 
gene, complete 
sequence 


le-59 


226005 


protein 40kD (Mus musculus] 


7e-05 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












DUAL SPECIFICITY 




1434 


L11285 


Homosapiens ERK 
activator kinase 
<MEK2)mRNA. 


le-59 


2499630 


MITOGEN -ACTIVATED 
PROTEIN KINASE KINASE 2 
(MAP KINASE KINASE 2) 
(MAPKK 2) kinase type 2 
(Callus gallus] 


3e-21 


1435 


AF086555 


Homo sapiens full 
length insert cDN A 
clone ZE14E04 


4e-60 


3287674 


(AC005239) F23 149.1 [Homo 
sapiens] 


2e-04 


1436 


M24766 


Human (clone 
DHAIV2-12) aiDha-2 
collagen type IV 


4e-60 


29551 


(X05610) alpha (2) chain 
[Homo sapiens] 


6c- 15 


1437 


X65550 


H.sapiens mki67a 
mRNA (long type) 
for antigen of 
monoclonal antibody 


4e-60 


1 1 70654 


ANTIGEN KI-67 
>gt|539555|pir||A48666 cell 
proliferation antigen Ki-67 t long 
form - human Ki-67 [Homo 


3e-I5 


1438 


M27319 


Human calmodulin 
mRNA, complete cds. 


4e-60 


1345451 


(X05949) Calmodulin (A A 2 - 
59) (449 is 1st base in codon) 
[Drosophila melanogaster] 


7e-20 


1439 


Y127RI 


Homo sapiens nrvRNA 
for transducin (beta) 

lilfp 1 nrntpin 


3e-60 


62133 


(X06172) put. 134 kD protein 
fAA 1 - 1 1S7V nut reolicase 


7.4 


1440 


ABO02383 


Human mRNA for 
KIAA0385 gene, 
complete cds 


le-60 


1001548 


(D64000) hypothetical protein 


4.4 


1441 


■i 

AF070614 


Homo sapiens clone 
24732 unknown 
mRNA. partial cds 


2e-61 


3283879 


(AF070614) unknown [Homo 
sapiens] 


3e-I7 


1442 


AB002326 


Human mRNA for 
KIAA0328 gene, 
partial cds 


6e-62 


547891 


MICROTUBULE- 
ASSOCIATED PROTEIN 4 
microtubule-associated protein- 
U [Bos taurus] 


5.6 


1443 


AF086471 


Homo sapiens full 
length insert cDNA 
clone ZD88AOI 


5e-62 


<NONE> 


<NONE> 


<NONE> 












'2,3- ' 

DIHYDROXYPHENYLPROPI 
ONATE 1,2-DIOXYGENASE 
>gi| 1657544 (U73857) similar 
to mcpl gene (catechol 2.3- 
dtoxygenase) of A. eutrophus 3- 
(2.3- 

dihydroxyphenylpropionate)l , 2- 
dioxygenase 2.3- 
dihydroxyphenylpropionate 1 ,2- 
diox\'2enase 




1444 


AB0023 11 


Human mRNA for 
KIAA0313 gene, 
complete cds 


2e-62 


2506357 


3.4 
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Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 
















L4-40 


AF069737 


Xenopus laevis 
notchless (nle) 
rrtRNA, complete cds 


2e-62 


3687833 


(AF069737) notchless [Xenopus 
laevis] 


le-55 


1446 


AF044209 


Homo sapiens nucleai 
receptor co-repressor 
N-CoR mRNA, 
complete cds 


5e-63 


2137603 


nuclear receptor co-repressor N- 
CoR- mouse musculus] 
>gi|1583865|prf]|2l2i436A 
thyroid hormone receptor co- 
repressor [Mus musculus] 


2e-47 


1447 


M69238 


Human aryl 
hydrocarbon receptor 
nuclear trans locator 
(ARNT) mRNA, 
complete cds. 


2e-63 


2702319 


(AF001307) aryl hydrocarbon 
receptor nuclear trans locator; 
Arnt [Homo sapiens] 


5e-19 


1448 


X80497 


Rsapiens PHKLA 
mRNA 


2e-63 


1 170685 


PHUSHHUK V LASL ti 
KINASE ALPHA 
REGULATORY CHAIN, 
LIVER ISOFORM 
(PHOSPHOR YLASE KINASE 
ALPHA L SUBUNIT) 
>gi|663010(XS0497) 
phosphorylase kinase 
phosphorylase kinase alpha 
subunit [Homo sapiens] 


5e-22 


1449 


AF031141 


Homo sapiens 
ubiquitin conjugating 
enzyme 


2e-63 


2623260 


(AF03I141) ubiquitin 
conjugating enzyme [Homo 
sapiens] 


le-23 


1450 


Z37166 


H.sapiens BAT I 
mRNA for nuclear 
*NA helicase 


6e-64 


2500529 


PROBABLE ATP- 
DEPENDENT RNA 
HELICASE P47 
>gi|2135840|pir||I37201 nuclear 
RNA helicase (DEAD family) 
BAT1 - human >gi|587146 
(Z37166) nuclear RNA helicase 
(DEAD family) [Homo sapiens] 


9e-24 


1451 


M64240 | 


Human helix- loop- 
helix zipper protein 
(max) mRNA, 
complete cds. > ;: 
gb|I41138|I41138 
Sequence I from 
patent US 5624818 > 
:: gb|I77062|I77062 
Sequence 1 from 
patent US 5693487 


5e-64 


88175 


Vlyc-bindine factor Max, short 
form - human 


8e-22 



^6 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












WOCOLLaGEN-LVSINEJ- 




1452 




Homo sapiens lysyl 
hydroxylase (partial 
clone 2.2 Kb LH) 
RNA, compleie 

miMtr** rv*ntiHp 
IllkltUIC [JtJJliUt.. 


2e-64 


400205 


OXOGLUTARATE 5- 
DIOXYGENASE 
PRECURSOR (LYSYL 
HYDROXYLASE) lysyl 
hvdmxvlnse FHomo saDiensl 


7e-22 


1453 


U09550 


Human oviductal 

a \ vr*rtnrr*f p in rtiRN A 

complete cds. 


8e-65 


2493676 


OVIDUCT-SPECIHC 
GLYCOPROTEIN 
PRECURSOR (OVIDUCTAL 
GLYCOPROTEIN) 
(OVDDUCTIN) 


2c- 1 1 


1454 


X67877 


R.norvegicus mRNA 
for cytosolic 
rcbinncr JiuAiii' 
binding protein 


7e-65 


423664 


restniteratoxin-bmding protein 
RBP-26, cytosolic - rat 
>gi|3 1 1660 (X67877) cytosolic 
resin iferatox in binding protein 
RBP-26 [Ratms norvegicus] 
>gi|l093373|pril|2l033lOA 

r^^ initVrnlnx i n-Hmfiino nrntein 

[Rattus norvegicus] 


2e-40 


1455 


ABO 1 8254 


Homo sapiens mRNA 
for KIAA07J1 
protein, complete cds 


6e-65 


92298 


glutamine/giutamic acid-rich 
protein 


0.9S 


1456 


J03607 


Human 40-kDa 
keratin intermediate 
filament precursor 
gene. 


3e-65 


1070608 


keratin 19, type I, cytoskeietal - 
human sapiens] 


4e-07 


1457 


U65896 


Hum in <nmm.i- 

glutamyl carboxylase 
gene, complete cds 


2e-65 


<NONE> 


<NONE> 


<NONE> 


1458 


U0768I 


Human NAD(H)- 
specific isoc Urate 
dehydrogenase alpha 
subunit precursor 
mRNA, complete cds. 


2e-65 


1708399 


lSULHKAlb 

DEHYDROGENASE (NAD), 
MITOCHONDRIAL SUBUNIT 
ALPHA PRECURSOR 
(ISOCITRIC 

DEHYDROGENASE) (NAD+- 
SPECIHC ICDH) 
dehydrogenase alpha chain 
precursor - human >gi|706839 
subunit precursor [Homo 
sapiens] 


4e-26 


1 1459 


USS080 


Human zinc finger 
protein (LD5-I) gene, 
exons 4, 5 and 6, and 
complete cds 


2e-65 


1373394 


(U57796) zinc finger protein 
[Homo sapiens] >ei [2306773 


2e-39 
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SEQ 
ID 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



1460 



146 



1462 



1463 



1464 



1465 



1466 



1467 



DESCRIPTION 



P VALUE 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



M96625 



U 1 3262 



U64033 



X78989 



U64033 



AF057365 



AJ006064 



Callus domes ticus 
tensin mRNA 
sequence. 



Mus muse u I us myelin 
gene expression 
factor (MEF-2) 
mRNA. partial cds. 



Mus musculus Tera 
(Tera) mRNA, 
complete cds 



Mmusculus mRNA 
for testin 



Vfus musculus Tera 
(Tera) mRNA, 
complete cds 



Canis familiaris UDP 
M-acetylglucosamine 
transporter mRNA, 
complete cds 



*attus norvegicus 
mRNA for coronin- 
ike protein 



U91582 



vtacaca fascicularis 
JDP- 

gl ucuronosy ltransfera 
se mRNA, complete 
cds 



DESCRIPTION 



3e-66 



le-70 



5e-72 



6e-74 



2e-74 



9e-79 



le-82 



4e-89 



P VALUE 



tensin - chicken (fragment) 



2134419 



>gi|63805 (218529) tensin 
[Gallus gallus] >gi|2 12755 
(L06662) tensin [Gallus gallus] 



536926 



(U 13262) myelin gene 
expression factor [Mus 
musculus] 



1575505 [(U64033) Tera [Mus musculus] 



TESTIN 2 (TES2) 
1351218 |[ CONTAINS: TESTIN 1 



1575505 



[(U64033) Tera [Mus musculus] 



3298605 



(AF057365) UDP N- 
acetylglucosamine transporter 
[[Canis familiaris] 



(AJ006064) coronin-Iike protein 
3757680 ([Rattus norvesicus] 



jKARYOGAMY PROTEIN 
JKAR4 yeast (Saccharomyces 
140396 (cerevisiae) 



le-5I 



9e-42 



9e-34 



8e-31 



5e-37 



9e-10 



3e-62 



le-OS 



1468 



X06762 



Mouse Hox2.3 
mRNA 



1469 



ABO 16930 



Cricetulus griseus 
mRNA for 

Phosphat idy lglycerop 
hosphate synthase, 
complete cds 



3e-92 



123255 



HOMEOBOX PROTEIN HOX- 
[B7 (HOX-2C) 



5e-94 



4159682 



1470 



X74504 



M. musculus T10 
mRNA 



(AB016930) 

Phosphatidyl glycerophosphate 
[synthase [Cricetulus griseus] 



7e-97 



1711658 



SER/THR-RICH PROTEIN 
TI0INDGCR REGION 
>gi|480900|pir||S37488 gene 
|T 10 protein - mouse 



9e-23 



7e-34 



3e-59 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












UBIVOIIUN-UUlNJUUHillSU " 




1471 


U13175 


Rattus norvegicus 
clone ubc 10a 
ubiquitin conjugating 
enzyme (E2I7kB) 
mRNA, complete cds. 


3e-98 


- 

1351345 


tTOHVIfc E.T-W KJJ 3 
(UBIQUITIN- PROTEIN 
LIGASE) (UBIQUITIN 
CARRIER PROTEIN) 
(E2(17)KB 3) 
>gi|1085588tpir||S53358 
ubiquitin conjugating enzyme 
(E217kB)-rat>gi|595666 
(U13I75) ubiquitin conjugating 
enzyme [Rattus norvegicus] 
norvegicus] >gi|l 145691 
(U39318) UbcH5C [Homo 
sapiens] 


5e-05 


1472 


O / SO f J 


h-lamp-2=iysosome- 
associaied membrane 
protein- 2 protein-2b 
(LAMP2) mRNA, 
alternatively spliced 
form h-lamp-2b, 

[IJJJwlC LUJ. 


e-1 19 


<NONF> 


<NONE> 


<NONE> 


1473 


D 13623 


Rat mRNA for p34 
protein, complete cds 


e-1 12 


480379 


ribosome- binding protein p34 - 
rat sp.] 


2e-05 


1474 


ABO 13357 


Mus musculus mRNA 
for 49 kDa zinc finger 
protein, complete cds 


e-1 36 


4153886 


(AB0I3357) 49 lcDa zinc finger 
protein 


5e-08 


1475 


ABO 1 6930 


Cricetulus griseus 
mRNA for 

Phosphatidylglycerop 
hosphate synthase, 
complete cds 


e-117 


4159682 


(AB0I6930) 

Phosphatidylglycerophosphate 
synthase [Cricetulus griseus] 


4e-32 


1476 


U38253 


Rattus norvegicus 
initiation factor elF- 
2B gamma subunit 
(eIF-2B gamma) 
mRNA, complete cds | 


— 

e-1 03 


24943 12 


TRANSLATION INITIATION 
FACTOR EIF-2B GAMMA 
SUBUNIT (EIF-2B GDP-GTP 
EXCHANGE FACTOR) 
subunit [Rattus norvegicus] 


3e-42 
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Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












UlPl^I'L t l^-.J 1 \UJ.&y ' 




1477 


X73683 


R.norvegicus mRNA 
for histone H33 


■ - 
e-117 


122075 


(IIj.jQ) nicotic HO.j fruit fly 
(Drosophila melanogaster) 
histone H3.3B - chicken 
>gi|2l 19023|pir||S61218 histone 
H3.3 - fruit fly (Drosophila 
hydei) 1-136) [Oryctolagus 
cunicutus] >gi|8046 (X53822) 
Histone H3.3Q gene product 
[Drosophila melanogaster] 
>gi|5119S gallus] >gi|161I90 
(M17876) histone H3 [Spisula 
solidissima] >gi|2 11853 
(Ml 1393) histone 3.3 [Gallus 
gallus] >gi|306848(M11354) 
H3.3 histone [Homo sapiens] 
melanogaster] >gi|96303 1 
(X8 1205) histone H3.3 H3.3A 
variant [Drosophila 
melanogaster] musculus] 


le-45 


147S 


U32498 


Rattus norvegicus 
rsec8 mRNA, partial 
cds 


e-108 


2143962 


rsec8 - rat (fragment) 
>gi|I01944I (U32498) rsecS 
[Rattus norvegicus] 


7e-4S 


1479 


U41736 


Mus musculus ancient 
ubiquitous 46 kDa 
protein AUP1 
precursor (Aupl) 
mRNA. complete cds 


e-146 


1517822 


(U41736) ancient ubiquitous 46 
kDa protein AUP46 precursor 
[Mus musculus] 


5e-49 


1480 


- 

AF041338 


Bos taurus vacuolar 
proton pump subunit 
SFD alpha isoform 
(SFD) mRNA, 
complete cds 


e-M9 


2895578 


(AF04133S) vacuolar proton 
pump subunit SFD alpha 
isoform [Bos taurus] 


3e-49 


1481 


AF064553 


Mus musculus NSD1 
protein mRNA, 
complete cds 


e-121 


3329465 


(AF064553) NSD1 protein 
[Mus musculus] 


2e-50 


1482 


AB000517 


Rattus sp. mRNA for 
CDP-diacyl glycerol 
synthase, complete 
cds 


e-146 


1517822 


(U41736) ancient ubiquitous 46 
kDa protein AUP46 precursor 
[Mus musculus] 


2e-51 


1483 


D38517 


Mouse mRNA for 
Dhml protein, 
complete cds 


el 18 


2137562 


mouse Dhrn 1 protein - mouse 
musculus] 


6e-54 
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Nearest Neighbor ( BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 

L - - -■- . — — 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






M.domesticus MD6 






CDC4 repeat unit-containing 




1484 


X54352 


mRNA 


e-139 


1085499 


protein - mouse 


te-55 


1485 


U57692 


Mus musculus N- 
terminal asparagine 
amidohydrolase 
(Ntanl) mRNA, 
complete cds 


e-118 


2498797 


PROTEIN N-TERMINAL 
ASPARAGINE 
AMIDOHYDROLASE 
(PROTEIN NH2 -TERMINAL 
ASPARAGINE DEAMIDASE) 
(NTN-AMIDASE) (PNAD) 
(PROTEIN NH2-TERMINAL 
ASPARAGINE 
AMIDOHYDROLASE) 
(PNAA) >gij 1 373365 (U5769I) 
N-terminal asparagine 
amidohydrolase (Mus musculus] 
amidohydrolase [Mus musculus] 


5e-57 


I486 


X80169 


M. musculus mRNA 
for 200 kD protein 


e-U9 


1717793 


PROTEIN TSG24 (MEIOTIC 
CHECK POINT 
REGULATOR) 
>gi|l083553|pir||A55W7isg24 


9e-58 


1487 
— — - 


U57692 


Mus musculus N- 
terminal asparagine 
amidohydrolase 
CNtanH mRNA 
complete cds 


e-120 


2498797 


PROTEIN N-TERMINAL 
ASPARAGINE 
AMIDOHYDROLASE 
(PROTEIN NH2-TERMINAL 
ASPARAGINE DEAMIDASE) 
(NTN-AMIDASE) (PNAD) 
(PROTEIN NH2-TERM1NAL 
ASPARAGINE 
AMIDOHYDROLASE) 
(PNAA) >gi| 1 373365 (U57691) 
^-terminal asparagine 

nmidnh vrfrnH^p fMiK miKOiltt<;] 

amidohydrolase [Mus musculus] 


8e-58 


1488 


U08215 


Mus musculus Hsp70- 
related NST-1 (hsrT) 
mRNA, complete cds. 


e-109 


473407 


(U082I5) NST-1 [Mus 
musculus] 


7e-58 


1489 


D85926 


Mouse mRNA for 
Ray, complete cds 


e-UO 


1944389 


(D85926) Rav [Mus musculus] 


2e-5S 


1490 


L20427 


Rattus norvegicus 

dihydroxypolyprenylb 

enzoate 

methyl transferase 
mRNA, complete cds 


e-123 


457372 


(L20427) 

dihydroxypolyprenylbenzoate 
methyltransferase 
dihydroxypolyprenylbenzoate 
methyltransterase [Rattus 
norvegicus] 


4e-59 


1491 


X56044 


M. musculus mRNA 
for protein Htf9C 


e-121 


3183977 


(X56044) protein HtfPC [Mus 
musculus J 


Ie-60 
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Nearest Neiehbor (BlastN vs. Genbank) 


Nearest Neiehbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 












PftOTO-ON'COGENE 




1492 


S74774 


p59fyn(T)=OKT3- 
induced calcium 
influx regulator 


e-163 


729896 


TYROS INE-PROTEIN 
KINASE FYN(P59-FYN) 
>gi|4202I7|pir)|A44991 protein- 
tyrosine kinase (EC 2.7.1. 1 12) 
fyn - mouse 


8e-63 


1493 


U88873 


Mas musculus BUB2- 
like protein 1 
(HBLPI)mRNA, 
complete cds 


e-123 


4099611 


(U88873) BUB2-like protein 1 
[Mus musculus] 


le-63 


1494 


U48852 


Cricetulus griseus HT 
protein mRNA, 
complete cds. 


e-117 


1216486 


(U48852) HT protein 
[Cricetulus griseus] 


7e-64 


1495 


AF032667 


Rattus norvegicus 
rexo70 mRNA, 
complete cds 


e-142 


2827160 


(AF032667) rexo70 [Rattus 
norvegicus] 


5e-66 


1496 


M62722 


Chinese hamster 
phosphatidylserine 
decarboxylase 
mRNA. 3* end. 


e-114 


118910 


PHOSrH A I'll} Y LSbRINK 

DECARBOXYLASE 

PROENZYME 

>gi|109423|pir||A38732 

phosphatidylserine 

decarboxylase (EC 4. 1.1. 65) - 

Chinese hamster (fragment) 


2e-67 


1497 


AF072758 


Mus musculus fatty 
acid transport protein 
3 mRNA, partial cds 


e-130 


3335567 


(AF07275S) fatty acid transport 
protein 3; FATP3 [Mus 
musculus] 


le-67 


1498 


AB005549 


Rattus norvegicus 
mRNA for atypical 
PKC specific binding 
protein, complete cds 


e-113 


3868778 


(AB005549) atypical PKC 
specific binding protein [Rattus 
norvegicus] 


2e-69 


1499 


U57344 


Mus musculus 
homeobox protein 
Meis3 mRNA, 
complete cds 


e-143 


3024124 


HOMEOBOX PROTEIN 
MEIS3 


6e-72 


1500 


U09874 


Mus musculus SKD3 
mRNA, complete cds. 


e-142 


2493735 


SKD3 PROTEIN SKD3 [Mus 
musculus] 


le-72 


1501 


U72194 


Mus musculus 
muskelin mRNA, 
complete cds 


e-148 


3493462 


(U72194) muskelin [Mus 
musculus] 


2e-74 


1502 


XS0169 


M. musculus mRNA 
for 200 kD protein 


e-155 


1717793 


PROTEIN TSG24 (MEIOTIC 
CHECKPOINT 
REGULATOR) 
>ai|IOS3553|pir||A55H7lsg24 


3e-77 
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SEQ 
ID 



Nearest Neighbor (BfastN vs. Genbank) 



ACCESSION 



DESCRIPTION 



1503 



U72I94 



1504 



Y12836 



P VALUE 



Nearest Neighbor (BlastX vs. Non-ReHunHnnr Proteins) 



ACCESSION 



Mus m use u Jus 



muskelin mRNA, 
complete cds 



Cricetulus griseus 
mRNA for Zn finger 
factor 



e-I54 



e-146 



DESCRIPTION 



3493462 



P VALUE 



(U72194) muskelin [Mus 
musculus] 

(Y12836) Zn finger factor 
[Cricetulus griseus] 



Wit- 
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Table 5 



SEQID 


Start 


Stop 


Score 


Direction 


Description 


29 


295 


421 


5872 


For 


mkk like kinases 


30 


31 


182 


3943 


For 


Basic region plus leucine zipper 
transcription factors 


31 


298 


397 


5625 


For 


mkk like kinases 


186 


175 


395 


7660 


For 


SH2 Domain 


187 


358 


432 


4320 


For 


Ank repeat 


196 


37 


322 


6049 


For 


mkk like kinases 


Z34 


Zj 


1 9 1 
I Z 1 


tou / 


i \ji 


SH3 Domain 


J\JO 


i i n 


1 79 
i / z 


41 


1 Ul 


7\r\c fincer C2H2 tvne 


410 


42 


191 


4036 


For 


Basic region plus leucine zipper 

transcrintion factors 


yii i 

43 1 


71 


498 


s;s;i8 

J J JO 


I\CV 


ATPasps Associated with Various 
Cellular Activities 

— 


552 


116 


288 


3930 


Rev 


Basic region plus leucine zipper 
transcription factors 


639 


157 


561 


5797 


For 


ATPases Associated with Various 
Cellular Activities 


746 


209 


427 


5379 


For 


Fibronectin type III domain 


768 


116 


288 


3930 


For 


Basic region plus leucine zipper 
transcription factors 


807 


339 


392 


3620 


For 


Zinc finger, C2H2 type 


820 


341 


406 


2930 


Rev 


EF-hand 


822 


108 


262 


4179 


For 


Basic region plus leucine zipper 
transcription factors 


836 


158 


353 


4430 


For 


Basic region plus leucine zipper 
transcription factors 


1157 


41 


444 


5279 


Rev 


protein kinase 


1192 


186 


416 


5469 


For 


Fibronectin type III domain 


1268 


238 


315 


3540 


For 


Ank repeat 


1269 


79 


240 


11640 


For 


LIM domain containing proteins 


1288 


73 


234 


3953 


For 


Basic region plus leucine zipper 
transcription factors 



fl2> 
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SEQ ID 


Start 


btop 


Score 


Direction 


Description 


1309 


248 


404 


8226 


for 


LIM domain containing proteins 


1324 


294 


356 


4690 


for 


Zinc finger, C2H2 type 


1325 


1 


234 


8981 


for 


C2 domain (prot. kinase C like) 


1336 


66 


164 


6390 


for 


WD domain, G-beta repeats 


1360 


222 


377 


8686 


for 


LIM domain containing proteins 


1365 


69 


257 


5221 


for 


Basic region plus leucine zipper 
transcription factors 


1380 


42 


140 


7130 


for 


WD domain, G-beta repeats 


1386 


243 


398 


8736 


for 


LIM domain containing proteins 


1410 


222 


350 


10553 


for 


Trypsin 


1417 


8 


354 


6073 


for 


Protein Tyrosine Phosphatase 


1454 


49 


209 


3996 


for 


Basic region plus leucine zipper 
transcription factors 


1464 


4 


180 


4978 


for 


RNA recognition motif, (aka RRM, 
RBD, or RNP domain) 


1478 


54 


437 


5176 


for 


protein kinase 


1496 


241 


520 


3929 


for 


Helicases conserved C-terminal domain 


1496 


40 


612 


5187 


for 


protein kinase 


1503 


154 


216 


4870 


for 


Zinc finger, C2H2 type 


1514 


2 


252 


4662 


for 


RNA recognition motif ( aka RRM 
RBD, or RNP domain) 


1527 


156 


212 


3520 

■mr? M^ \S 


for 


Zinc fincer. C2H2 tvDe 


1538 


9 


635 


11087 


for 


wnt family of developmental signaling 
proteins 


1540 


289 


471 


4107 


for 


Basic region plus leucine zipper 
transcription factors 


1549 


200 


391 


4118 


for 


Basic region plus leucine zipper 
transcription factors 


1556 


163 


354 


3958 


for 


Basic region plus leucine zipper 
transcription factors 


1557 


207 


398 


4038 


for 


Basic region plus leucine zipper 
transcription factors 


1563 


107 


298 


3978 


for 


Basic region plus leucine zipper 
transcription factors 



WO0102568 [file://C:\Documents and Settinqs\Lawrence\Locai Setting sMemp orary Intern et Files\OLK1 1\WO0102568.c pcl Page 476 of 1046 



WO 01/02568 PCT/DS00/18374 



SEQ1D 


Start 


Stop 


Score 


Direction 


Description 


1622 


180 


365 


4022 


for 


Basic region plus leucine zipper 
transcription factors 


1630 


100 


291 


3998 


for 


Basic region plus leucine zipper 
transcription factors 


1674 


196 


258 


4880 


for 


Zinc finger, C2H2 type 


1676 


9 


86 


6610 


for 


Homeobox Domain 


1677 


316 


369 


5780 


rev 


Thioredoxins 


1688 


109 


410 


17414 


for 


Ras family 


1704 


184 


372 


3977 


for 


Basic region plus leucine zipper 
transcription factors 


1707 


92 


439 


24100 


rev 


Phosphatidylinositol-specific 
phospholipase C, Y domain 


1711 


263 


361 


6400 


for 


WD domain, G-beta repeats 


1744 


238 


433 


10572 


rev 


Serine carboxypeptidases 


1755 


281 


367 


2580 


for 


EF-hand 


1762 


236 


334 


5880 


for 


WD domain, G-beta repeats 


1779 


64 


126 


4790 


for 


Zinc finger, C2H2 type 


1801 


295 


351 


4030 


for 


Zinc finger, C2H2 type 


1 OV/H 


JVl 


J I o 




lUI 


/auk. repeal 


1808 
1 Ovo 




1 f\\ 
1 Ql 


4170 


fnr 


DdMt ICglUll pi lid ICUClilt Z,ippCl 

transcription factors 


1811 


184 


315 


8390 


for 


N-terminal homology in Ets domain 


1814 


127 


294 


10770 


for 


Bromodomain (conserved sequence 
found in human, Drosophila and yeast 
proteins.) 


1818 


9 


146 


4741 


for 


Double-stranded RNA binding motif 


1819 


278 


355 


3460 


for 


Ank repeat 


1820 


123 


299 


12150 


for 


Homeobox Domain 


1821 


127 


303 


12180 


for 


Homeobox Domain 


1830 


184 


267 


4270 


for 


Ank repeat 


1832 


18 


173 


8987 


for 


SH3 Domain 


1835 


51 


206 


8987 


for 


SH3 Domain 


1839 


224 


307 


4270 


for 


Ank repeat 


1846 


12 


398 


36700 


for 


G-protein alpha subunit 



'TIS 
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SEQ ID 


Start 


Stop 


Score 


Direction 


Description 


1909 


160 


258 


6370 


for 


WD domain, G-beta repeats 


1911 


35 


151 


9335 


for 


Zinc finger, C3HC4 type (RING finger) 


1980 


60 


197 


7917 


for 


Zinc finger, C3HC4 type (RING finger) 


2065 


253 


306 


5410 


for 


Zinc finger, CCHC class 


2135 


2 


401 


10596 


for 


ATPases Associated with Various 
Cellular Activities 


2216 


90 


179 


5380 


for 


WW/rsp5/WWP domain containing 
proteins 


2218 


127 


225 


5500 


for 


WD domain, G-beta repeats 


2281 


20 


387 


6044 


for 


Protein Tyrosine Phosphatase 


2282 


183 


353 


5136 


for 


C2 domain (prot. kinase C like) 


2286 


12 


382 


5228 


for 


protein kinase 


2310 


20 


371 


5962 


for 


Protein Tyrosine Phosphatase 


2363 


48 


211 


4132 


for 


Basic region plus leucine zipper 
transcription factors 


2424 


43 


194 


3996 


for 


Basic region plus leucine zipper 
transcription factors 


2428 


25 


350 


4675 


for 


Dual specificity phosphatase, catalytic 
domain 


2562 


18 


101 


4560 


for 


Ank repeat 


2577 


0 


311 


10295 


for 


4 transmembrane segments integral 
membrane proteins 


2591 


60 


165 


4560 


for 


SH2 Domain 


2684 


9 


461 


5759 


for 


ATPases Associated with Various 
Cellular Activities 


2826 


116 


400 


16107 


for 


DEAD and DEAH box helicases 


2859 


100 


320 


5550 


rev 


ATPases Associated with Various 
Cellular Activities 


2871 


198 


392 


9384 


for 


DEAD and DEAH box helicases 


2944 


18 


281 


10480 


for 


Calpain large subunit, domain 111 


2969 


5 


387 


5976 


rev 


protein kinase 


3015 


131 


214 


3600 


for 


Ank repeat 


3047 


191 


292 


5295 


for 


WD domain, G-beta repeats 


3081 


190 


252 


4360 


for 


Zinc finger, C2H2 type 


3108 


275 


367 


5791 


for 


WD domain, G-beta repeats 


3147 


190 


369 


4022 


for 


Basic region plus leucine zipper 
transcription factors 


3152 


129 


320 


3947 


for 


Basic region plus leucine zipper 
transcription factors 


3158 


167 


334 


4180 


for 


Basic region plus leucine zipper 
transcription factors 


3175 


14 


164 


5951 


for 


mkk like kinases 



47 <p 
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SEQ1D 


Start 


Stop 


Score 


Direction 


Description 


3175 


8 


112 


5968 


for 


protein kinase 


3178 


45 


386 


19398 


for 


ATPases Associated with Various 
Cellular Activities 


3183 


14 


215 


9133 


for 


4 transmembrane segments integral 
membrane proteins 


3190 


229 


390 


6089 


for 


mkk like kinases 


3190 


118 


390 


8063 


for 


protein kinase 


3193 


293 


355 


3570 


for 


Zinc finger, C2H2 type 


3195 


0 


215 


10146 


for 


4 transmembrane segments integral 
membrane proteins 


3197 


281 


343 


4490 


for 


Zinc finger, C2H2 type 


3208 


34 


256 


4190 


for 


Basic region plus leucine zipper 
transcription factors 


3258 


138 


394 


9877 


for 


Ras family 


3266 


8 


139 


9328 


for 


ATPases Associated with Various 
Cellular Activities 


3267 


97 


180 


3820 


for 


Ank repeat 


3274 


11 


187 


15442 


for 


Fork head domain, eukaryotic 
transcription factors 


3281 


15 


182 


9681 


for 


mkk like kinases 


3285 


16 


102 


4680 


for 


EF-hand 


3292 


208 


300 


5585 


for 


WD domain, G-beta repeats 


3297 


7 


153 


6100 


for 


Helicases conserved C-terminal domain 


3306 


161 


223 


4900 


for 


Zinc finger, C2H2 type 


3307 


43 


321 


8740 


for 


SH2 Domain 


3339 


94 


342 


14970 


for 


SH2 Domain 


3345 


65 


271 


12512 


for 


PDZ domain 


3351 


124 


270 


6068 


for 


Phorbol esters/diacylglycerol binding 



t?7 
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Example 4 

Differential Expression of Polynucleotides of the Invention: 
Description of Libraries and Detection of Differential Expression 

5 The relative expression levels of the polynucleotides of the invention 

was assessed in several libraries prepared from various sources, including cell lines and 
patient tissue samples. Table 6 provides a summary of these libraries, including the 
shortened library name (used hereafter), the mRNA source used to prepare the cDNA 
library, the abbreviated name of the library that is used in the tables below (in quotes), 
1 0 and the approximate number of clones in the library. 



Table 6 

Description of cDNA Libraries 



Library 
(lib #) 


Description 


Number of 
Clones in 

this 
Clustering 


1 


Kml2L4 

Human Colon Cell Line, High Metastatic Potential 
(derived from Kml2C) 
"High Colon" 


307133 


2 


KmI2C 

Human Colon Cell Line, Low Metastatic Potential 
"Low Colon" 


284755 


3 


MDA-MB-23 1 

Human Breast Cancer Cell Line, High Metastatic Potential; 
micro-metastases in lung 
"High Breast" 


326937 


4 


MCF7 

Human Breast Cancer Cell, Non Metastatic 
"Low Breast" 


318979 


8 


MV-522 

Human Lung Cancer Cell Line, High Metastatic Potential 
"High Lung" 


223620 


9 


UCP-3 

Human Lung Cancer Cell Line, Low Metastatic Potential 
"Low Lung" 


312503 
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Library 
(lib #) 


Description 


Number of 
Clones in 

this 
Clustering 


12 


Human microvascular endothelial cells (HMEC) - Untreated 
PCR (OligodT) cDNA library 


41938 


13 


Human microvascular endothelial cells (HMEC) - 
Basic fibroblast growth factor (bFGF) treated 
PCR (OligodT) cDNA library 


42100 


14 


Human microvascular endothelial cells (HMEC) - 
Vascular endothelial growth factor (VEGF) treated 
PCR (OligodT) cDNA library 


42825 


15 


Normal Colon - UC#2 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


34285 


16 


Colon Tumor - UC#2 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


35625 


17 


Liver Metastasis from Colon Tumor of UC#2 Patient 
PCR (OligodT) cDNA library 
"High Colon Metastasis Tissue" 


36984 


18 


Normal Colon - UC#3 Patient 
PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


36216 


19 


Colon Tumor - UC#3 Patient 
PCR (OligodT) cDNA library 
"High Colon Tumor Tissue" 


41388 


20 


Liver Metastasis from Colon Tumor of UC#3 Patient 
PCR (OligodT) cDNA library 
"High Colon Metastasis Tissue" 


30956 


21 


GRRpz 

Human Prostate Cell Line 


164801 


22 


WOca 

Human Prostate Cancer Cell Line 


1 62088 



The KM12L4 and KM12C cell lines are described in Example 1 above. 
The MDA-MB-231 cell line was originally isolated from pleural effusions (Cailleau, 1 
Natl Cancer. InsL (1974) 53:661), is of high metastatic potential, and forms poorly 
5 differentiated adenocarcinoma grade II in nude mice consistent with breast carcinoma. 
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10 



The MCF7 cell line was derived from a pleural effusion of a breast adenocarcinoma and 
is non-metastatic. The MV-522 cell line is derived from a human lung carcinoma and is 
of high metastatic potential. The UCP-3 cell line is a low metastatic human lung 
carcinoma cell line; the MV-522 is a high metastatic variant of UCP-3. These cell lines 
are well-recognized in the art as models for the study of human breast and lung cancer 
(see, e.g., Chandrasekaran et a]., Cancer Res. (1979) 3P:870 (MDA-MB-231 and MCF- 
7); Gastpar et al., J Med Chem (1998) 47:4965 (MDA-MB-231 and MCF-7); Ranson et 
aL, Br J Cancer (1998) 77:1586 (MDA-MB-231 and MCF-7); Kuang et al., Nucleic 
Acids Res (1998) 26:1116 (MDA-MB-231 and MCF-7); Varki et al., lnt J Cancer 
(1987) 40A6 (UCP-3); Varki et al., Tumour Biol (1990) 77:327; (MV-522 and UCP-3); 
Varki et al., Anticancer Res. (1990) 70:637; (MV-522); Kelner et al., Anticancer Res 
(1995) 7.5:867 (MV-522); and Zhang et ah, Anticancer Drugs (1997) 5:696 (MV522)). 
The samples of libraries 15-20 are derived from two different patients (UC#2, and 
UC#3). The bFGF-treated HMEC were prepared by incubation with bFGF at 1 Ong/ml 
15 for 2 hrs; the VEGF-treated HMEC were prepared by incubation with 20ng/ml VEGF 
for 2 hrs. Following incubation with the respective growth factor, the cells were 
washed and lysis buffer added for RNA preparation. The GRRpz cell line refers to low 
passage (3 passages or fewer) human prostate cells, and the WOca cell line refers to low 
passage (3 passages or fewer) human prostate cancer cells. 
20 Each of the libraries is composed of a collection of cDNA clones that in 

turn are representative of the mRNAs expressed in the indicated mRNA source. In 
order to facilitate the analysis of the millions of sequences in each library, the sequences 
were assigned to clusters. The concept of "cluster of clones" is derived from a 
sorting/grouping of cDNA clones based on their hybridization pattern to a panel of 
25 roughly 300 7bp oligonucleotide probes (see Drmanac et al., Genomics (1996) 
57(1):29). Random cDNA clones from a tissue library are hybridized at moderate 
stringency to 300 7bp oligonucleotides. Each oligonucleotide has some measure of 
specific hybridization to that specific clone. The combination of 300 of these measures 
of hybridization for 300 probes equals the "hybridization signature" for a specific clone. 
30 Clones with similar sequence will have similar hybridization signatures. By developing 
a sorting/grouping algorithm to analyze these signatures, groups of clones in a library 
can be identified and brought together computationally. These groups of clones are 
termed "clusters". Depending on the stringency of the selection in the algorithm 
(similar to the stringency of hybridization in a classic library cDNA screening protocol), 
35 the "purity" of each cluster can be controlled. For example, artifacts of clustering may 
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occur in computational clustering just as artifacts can occur in "wet-lab" screening of a 
cDNA library with 400 bp cDNA fragments, at even the highest stringency. The 
stringency used in the implementation of cluster herein provides groups of clones that 
are in general from the same cDNA or closely related cDNAs. Closely related clones 

5 can be a result of different length clones of the same cDNA, closely related clones from 
highly related gene families, or splice variants of the same cDNA. 

Differential expression for a selected cluster was assessed by first 
determining the number of cDNA clones corresponding to the selected cluster in the 
first library (Clones in 1 st ), and the determining the number of cDNA clones 

10 corresponding to the selected cluster in the second library (Clones in 2 nd ). Differential 
expression of the selected cluster in the first library relative to the second library is 
expressed as a "ratio" of percent expression between the two libraries. In general, the 
"ratio" is calculated by: 1) calculating the percent expression of the selected cluster in 
the first library by dividing the number of clones corresponding to a selected cluster in 

15 the first library by the total number of clones analyzed from the first library; 
2) calculating the percent expression of the selected cluster in the second library by 
dividing the number of clones corresponding to a selected cluster in a second library by 
the total number of clones analyzed from the second library; 3) dividing the calculated 
percent expression from the first library by the calculated percent expression from the 

20 second library. If the "number of clones" corresponding to a selected cluster in a library 
is zero, the value is set at 1 to aid in calculation. The formula used in calculating the 
ratio takes into account the "depth" of each of the libraries being compared, /.<?., the 
total number of clones analyzed in each library. 

In general, a polynucleotide is said to be significantly differentially 

25 expressed between two samples when the ratio value is greater than at least about 2, 
preferably greater than at least about 3, more preferably greater than at least about 5 , 
where the ratio value is calculated using the method described above. The significance 
of differential expression is determined using a z score test (Zar, Biostatistical Analysis, 
Prentice Hall, Inc., USA, "Differences between Proportions " pp 296-298 (1974)). 
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EXAMPLE 5 

Polynucleotides Differentially Expressed in High Metastatic Potential 
Breast Cancer Cells Versus Low Metastatic Breast Cancer Cells 

5 A number of polynucleotide sequences have been identified that are 

differentially expressed between cells derived from high metastatic potential breast 
cancer tissue and low metastatic breast cancer cells. Expression of these sequences in 
breast cancer can be valuable in determining diagnostic, prognostic and/or treatment 
information. For example, sequences that are highly expressed in the high metastatic 

10 potential cells can be indicative of increased expression of genes or regulatory 
sequences involved in the metastatic process. A patient sample displaying an increased 
level of one or more of these polynucleotides may thus warrant more aggressive 
treatment. In another example, sequences that display higher expression in the low 
metastatic potential cells can be associated with genes or regulatory sequences that 

15 inhibit metastasis, and thus the expression of these polynucleotides in a sample may 
warrant a more positive prognosis than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a 
diagnostic marker, a prognostic marker, for risk assessment, patient treatment and the 
like. These polynucleotide sequences can also be used in combination with other 

20 known molecular and/or biochemical markers. 

The following tables summarize polynucleotides that are differentially 
expressed between high metastatic potential breast cancer cells and low metastatic 
potential breast cancer cells. 

Table 7 

25 Differentially expressed polynucleotides: Higher expression in 

high metastatic potential breast cancer (lib3) relative to low metastatic 

breast cancer cells (Iib4) 



SEQ ID NOs: 


Lib3 clones 


Lib4 clones 


Iib3/lib4 


472 


64 


0 


62 


1851 


6 


0 


6 


1856 


8 


0 


8 


1867 


6 


0 


6 


1872 


6 


0 


6 


1875 


12 


3 


4 


1923 


89 


22 


4 
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SEQ ID NOs: 


Lib3 clones 


Lib4 clones 


Iib3/lib4 


2118 


7 


0 


7 


2119 


7 


0 


7 


2135 


37 


13 


3 


2190 


19 


0 


19 


2193 


16 


5 


3 


2232 


12 


2 


6 


2239 


6 


0 


6 


2338 


21 


2 


10 


2378 


16 


4 


4 


2394 


6 


0 


6 


2395 


6 


0 


6 


2490 


13 


3 


4 


2505 


16 


2 


8 


2540 


8 


1 


8 


2542 


11 


1 


11 


2607 


11 


2 


5 


2640 


22 


5 


4 


2674 


8 


0 


8 


2679 


19 


0 


19 


2684 


14 


4 


3 


2707 


8 


0 


8 


2724 


9 


0 


9 


2757 


6 


0 


6 


2776 


10 


0 


10 


2804 


13 


2 


6 


2818 


6 


0 


6 


2906 


14 


0 


14 


2959 


26 


8 


3 


2964 


17 


4 


4 


2968 


6 


0 


6 


2977 


22 


3 


7 


2980 


13 


1 


13 


3010 


6 


0 


6 


3043 


10 


1 


10 


3071 


33 


12 


3 


3072 


9 


1 


9 


3095 


19 


3 


6 


3097 


11 


2 


5 


3173 


12 


2 


6 


3203 


8 


1 


8 


3210 


27 


8 


3 
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SEQ ID NOs: 


Lib3 clones 


Lib4 clones 


Iib3/lib4 


3212 


13 


1 


13 


3284 


8 


0 


8 


3288 


6 


0 


6 


3331 


14 


3 


5 


3335 


13 


1 


13 



Table 8 

Differentially expressed polynucleotides: Higher expression in 
low metastatic breast cancer cells (liM) relative to high metastatic 
5 potential breast cancer (lib3) 



SEQ ID NOs: 


Lib 3 Clones 


Lib 4 Clones 


Iib4/lib3 


402 


0 


6 


6 


614 


3 


21 


7 


624 


0 


6 


6 


626 


0 


8 


8 


712 


0 


9 


9 


744 


0 


7 


7 


1325 


2 


29 


15 


1452 


2 


13 


7 


1880 


0 


9 


9 


1915 


0 


7 


7 


1951 


0 


6 


6 


1955 


8 


32 


4 


2015 


0 


7 


7 


2046 


0 


7 


7 


2076 


1 


22 


23 

- 


2087 


0 


6 


6 


2124 


0 


9 


9 


2145 


0 


8 


8 


2162 


0 


6 


6 


2163 


0 


12 


12 


2164 


5 


19 


4 


2172 


2 


15 


8 


2192 


5 


16 


3 


2244 


20 


43 


2 


2266 


3 


18 


6 


2313 


24 


56 


2 


2346 


1 


13 


13 
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SEQ ID NOs: 


Lib 3 Clones 


Lib 4 Clones 


Iib4/lib3 


2355 


0 


10 


10 


2371 


0 


6 


6 


2393 


1 


17 


17 


2404 


1 


21 


22 


2443 


0 


6 


6 


2460 


0 


11 


11 


2523 


0 


6 


6 


2575 


1 


10 


10 


2578 


0 


6 


6 


2584 


1 


17 


17 


2590 


1 

0 




6 


6 


2609 


1 


9 


9 


2632 


5 


24 


5 


2714 


5 


24 


5 


2728 


0 


6 


6 


2752 


1 


14 


14 


2794 


4 


15 


4 


2826 


0 


7 


7 


2987 


5 


15 


3 


3005 


1 


14 


14 


3009 


20 


58 


3 


3047 


4 


17 


4 


3057 


2 


17 


9 


3075 


2 


11 


6 


3076 


0 


6 


6 


3102 


0 


6 


6 


3128 


15 


52 


4 


3132 


15 


52 


4 


3142 


0 


6 


6 


3187 


22 


49 


2 


3253 


23 


96 


4 


3282 


19 


46 


2 


3285 


20 


40 


2 


3346 


0 


9 


9 
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EXAMPLE 6 

Polynucleotides Differentially Expressed in High Metastatic Potential Lung 
Cancer Cells Versus Low Metastatic Lung Cancer Cells 

A number of polynucleotide sequences have been identified that are 
differentially expressed between cells derived from high metastatic potential lung 
cancer cells and low metastatic lung cancer cells. Expression of these sequences in lung 
cancer tissue can be valuable in determining diagnostic, prognostic and/or treatment 
information. For example, sequences that are highly expressed in the high metastatic 
potential cells can be indicative of increased expression of genes or regulatory 
sequences involved in the metastatic process. A patient sample displaying an increased 
level of one or more of these polynucleotides may thus warrant more aggressive 
treatment. In another example, sequences that display higher expression in the low 
metastatic potential cells can be associated with genes or regulatory sequences that 
1 5 inhibit metastasis, and thus the expression of these polynucleotides in a sample may 
warrant a more positive prognosis than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a 
diagnostic marker, a prognostic marker, for risk assessment, patient treatment and the 
like. These polynucleotide sequences can also be used in combination with other 
20 known molecular and/or biochemical markers. 

The following tables summarize polynucleotides that are differentially 
expressed between high metastatic potential lung cancer cells and low metastatic 
potential lung cancer cells: 
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Table 9 

Differentially expressed polynucleotides: Higher expression in high 
metastatic potential lung cancer cells (libS) relative to low 
metastatic lung cancer cells (lib9) 



CCA TPl xirv 


LIDO ClUIlCb 


T ihO rlnnp*; 


Iib8/lib9 


1 A 


1 0 


o 


10 


1 11 


<: 
j 


yj 


5 


1 C 1 


c 

j 


\j 


7 


1 C"> 

152 


O 


A 


1 ^ 


1 /I 


/: 
O 


A 
U 


o 


^AA 

200 


1 A 
1U 


A 


14 


O C A 

254 


c 

J 


A 
U 


7 


262 


c 
J 


A 
U 


7 
/ 


271 


c 

J 


A 
U 


7 
/ 


348 


6 


1 
1 


0 
o 


A 1 O 

412 


c 

D 


A 
U 


7 


507 


c 
5 


A 
U 


7 
/ 

- — 1 


520 


6 


A 
U 


0 
o 


530 


5 


A 
U 


7 


coo 

588 


c 
3 


A 
U 


7 


623 


/ 


A 
U 


1 0 


637 


/ 


A 
U 


1 w 


A 

660 


c 


A 
U 


7 


o/o 


o 
5 


A 


1 1 


/OA 

ooU 


c 
J 


A 

\/ 


7 


'TAA 


Q 


z. 


6 


/I4 


Zo 


1 ^ 


3 


774 


1 1 


0 


15 


812 


5 


0 


7 


834 


8 


2 


6 


901 


11 


2 


8 


1168 


5 


0 


7 


1333 


6 


0 


8 


1352 


5 


0 


7 


1524 


11 


1 


15 


1706 


5 


0 


7 


1752 


17 


9 


3 


1768 


20 


4 


7 


1769 


5 


0 


7 


1780 


6 


0 


8 
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SEQ ID NO: 
1781 
1799 
1803 
1811 
1884 
1919 
1939 
1975 
2024 
2045 
2060 
2071 
2128 
2177 
2181 
2184 
2185 
2283 
2311 
2314 
2393 
2398 
2460 
2514 
2597 
2657 
2669 
2670 
3047 
3050 
3092 
3140 
3157 
3187 
3210 
3220 
3236 
3249 
3264 
3305 
3309 



Lib8 clones 
40 



16 



8 



43_ 
_I2 
_8 
20 
16 



\0_ 
44 

TT 

10 



11 

]0_ 
14 



10 



8 



2J_ 
16 



181 



16 



28 



I 6 - 
8 



20 



Lib9 clones 



0 



0 



13 



0 



13 



0 



0 



0^ 
0 



119 

0 



0 



0 



0_ 
0 



Iib8/lib9 
19 
8 
8 



_8 
8 



11 
11 



15 



10 



14 



8 



8 



_8 
10 



10 



_T0 

i°_ 
22 



\0_ 
28 
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SEQ ID NO: 


Lib8 clones 


Lib9 clones 


Iib8/lib9 


3318 


24 


4 


8 


3330 


5 


0 


7 


3331 


5 


0 


7 



Table 10 

Differentially expressed polynucleotides: Higher expression in low metastatic lung 
cancer cells (lib 9) relative to high metastatic potential lung cancer cells (lib 8) 



SEQ ID NO: 


Lib 8 clones 


Lib 9 clones 


lib 9/lib 8 


24 


3 


20 


5 


53 


0 


18 


13 


64 


0 


8 


6 


70 


0 


11 


8 


105 


10 


66 


5 


129 


0 


16 


11 


214 


1 


14 


10 


233 


4 


35 


6 


237 


0 


13 


9 


264 


0 


29 


21 


329 


2 


17 


6 


368 


1 


37 


26 


370 


0 


11 


8 


418 


0 


8 


6 


450 


0 


9 


6 


461 


0 


9 


6 


484 


0 


26 


19 


494 


0 


41 


29 


517 


1 


12 


9 


522 


1 


11 


8 


581 


1 


17 


12 


614 


3 


23 


5 


706 


0 


11 


8 


726 


5 


23 


3 


806 


0 


14 


10 


824 


0 


9 


6 


836 


1 


14 


10 


874 


0 


12 


9 


900 


5 


21 


3 


1017 


2 


14 


5 
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SEQ ID NO: 


Lib 8 clones 


Lib 9 clones lib 9/lib 8 


1144 


0 


8 6 


1154 


0 


12 9 


1166 


2 


45 16 


1170 


1 


13 9 


1302 


2 


13 5 


1326 


1 


13 9 


1327 


1 


13 9 


1367 


0 


12 9 


1377 


0 


12 9 


1437 


2 


18 6 


1442 


1 


14 10 


1466 


0 


13 9 


1476 


0 


13 9 


1495 


0 


8 6 


1496 


1 


13 9 


1664 


38 


253 5 


1682 


1 


17 12 


1687 


0 


9 6 


1758 


0 


8 6 


1817 


4 


18 3 


1837 


3 


16 4 


1845 


3 


23 5 


1856 


2 


17 6 


1910 


1 


18 13 


2146 


2 


16 9 


2156 


0 


9 6 


2463 


0 


12 9 


2724 


10 


38 3 


2749 


403 


2000 4 


2801 


6 


25 3 


2993 


3 


18 4 


3080 


0 


10 7 


3107 


3 


23 1 5 


3292 


0 


20 


14 


3324 


110 


548 


4 
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EXAMPLE 7 

Polynucleotides Differentially Expressed in High Metastatic Potential 
Colon Cancer Cells Versus Low Metastatic Colon Cancer Cells 

5 A number of polynucleotide sequences have been identified that are 

differentially expressed between cells derived from high metastatic potential colon 
cancer cells and low metastatic colon cancer cells. Expression of these sequences in 
colon cancer tissue can provide diagnostic, prognostic and/or treatment information. 
For example, sequences that are highly expressed in the high metastatic potential cells 

10 can be indicative of increased expression of genes or regulatory sequences involved in 
the metastatic process. A patient sample displaying an increased level of one or more of 
these polynucleotides may thus warrant more aggressive treatment. In another example, 
sequences that display higher expression in the low metastatic potential cells can be 
associated with genes or regulatory sequences that inhibit metastasis, and thus the 

15 expression of these polynucleotides in a sample may warrant a more positive prognosis 
than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a 
diagnostic marker, a prognostic marker, for risk assessment, patient treatment and the 
like. These polynucleotide sequences can also be used in combination with other 

20 known molecular and/or biochemical markers. 

The following table summarizes identified polynucleotides with 
differential expression between high metastatic potential colon cancer cells and low 
metastatic potential colon cancer cells: 

Table 1 1 

25 Differentially expressed polynucleotides: Higher expression in low metastatic colon 
cancer cells (lib 2) relative to high metastatic potential colon cancer cells (lib 1) 



SEQ ID NOs: 


Lib 1 clones 


Lib 2 clones 


lib 2/lib 1 


429 


0 


9 


10 


1494 


0 


8 


9 


1923 


34 


114 


4 


1986 


3 


12 


4 


2018 


0 


9 


10 


2036 


2 


10 


5 


2049 


8 


25 


3 


2135 


24 


87 


4 
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SEQ ID NOs: 


Lib 1 clones 


Lib 2 clones 


lib 2/lib 1 


2146 


2 


16 


9 


2208 


6 


27 


5 


2215 


2 


11 


6 


2239 


1 


10 


11 


2307 


2 


12 


6 


2313 


28 


62 


2 


2357 


5 


14 


j 


2360 


3 


21 


8 


2362 


0 


6 


6 


2378 


3 


12 


4 


2569 


3 


20 


7 


2571 


0 


6 


6 


2588 


54 


172 


3 


2592 


15 


41 


3 


2611 


0 


6 


6 


2636 


0 


9 


10 


2641 


7 


20 


3 


2650 


0 


9 


10 


2662 


0 


9 


10 


2674 


4 


13 


4 


2682 


0 


6 


6 


2702 


9 


25 


3 


2704 


8 


23 


3 


2715 


2 


12 


6 


2804 


9 


22 


3 


2821 


13 


29 


2 


2840 


1 


8 


9 


2846 


2 


15 


8 


2866 


0 


6 


6 


2906 


0 


6 


6 


2915 


44 


109 


3 


2933 


0 


6 


6 


2935 


5 


16 


3 


2957 


1 


11 


12 


2959 


3 


27 


10 


2977 


16 


30 


2 


2980 


12 


27 


2 


3000 


2 


13 


7 


3009 


12 


29 


3 


3115 


0 


7 


8 


3156 


502 


2170 


5 
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SEQ ID NOs: 


Lib 1 clones 


Lib 2 clones 


lib 2/lib 1 


3210 


2 


21 


11 


3211 


0 


9 


10 


3213 


0 


7 


8 


3235 


2 


12 


6 


3251 


2 


12 


6 


3296 


-* 
_> 


12 


4 


3335 


1 


8 


9 





EXAMPLE 8 

Polynucleotides Differentially Expressed in High Metastatic Potential 
Colon Cancer Patient Tissue Versus Normal Patient Tissue 

5 

A number of polynucleotide sequences have been identified that are 
differentially expressed between cells derived from high metastatic potential colon 
cancer tissue and normal tissue. Expression of these sequences in colon cancer tissue 
can provide diagnostic, prognostic and/or treatment information. For example, 

10 sequences that are highly expressed in the high metastatic potential cells can be 
indicative of increased expression of genes or regulatory sequences involved in the 
advanced disease state which involves processes such as angiogenesis, dedifferentiation, 
cell replication, and metastasis. A patient sample displaying an increased level of one 
or more of these polynucleotides may thus warrant more aggressive treatment. 

15 The differential expression of these polynucleotides can be used as a 

diagnostic marker, a prognostic marker, for risk assessment, patient treatment and the 
like. These polynucleotide sequences can also be used in combination with other 
known molecular and/or biochemical markers. 

The following tables summarize polynucleotides that are differentially 

20 expressed between high metastatic potential colon cancer tissue and normal colon 
tissue: 
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Table 1 2 

Differentially expressed polynucleotides isolated from samples from two patients 
(patient 2 and patient 3 and) : Lower expression in high metastatic potential colon tissue 
(patient 2:lib 17; patient 3:lib 20) vs. normal colon tissue (patient 2:lib 15; patient 
5 3:libl8) 



SEQ ID NO: 


lib 15 clones 


1*1 1 **1 1 

lib 1/ clones 


lib 15/lib 17 


69 


19 


7 


3 


123 


6 


0 


6 


140 


24 


8 


3 


197 


6 


0 


6 


198 


113 


0 


121 


254 


28 


9 


3 


412 


28 


9 


3 


512 


11 


1 


12 


641 


17 


7 


3 


642 


7 


0 


8 


954 


12 


3 


4 


1011 


209 


16 


14 


1024 


8 


0 


9 


1040 


12 


3 


4 


1055 


26 


7 


4 


1106 


31 


15 


2 


1125 


17 


0 


18 


1 129 


17 


0 


18 


1 138 


109 


0 


117 


1 O A A 

1244 


1 A 

14 


1 


15 


1253 


73 


0 


78 


1283 


34 


7 


5 


1285 


34 


7 


5 


1339 


13 


4 


3 


1474 


73 


0 


78 


1505 


18 


3 


6 


1553 


68 


6 


12 


1554 


2542 


14 


195 


1605 


2542 


14 


195 


1628 


6 


0 


6 


1643 


142 


4 


38 


1753 


12 


0 


10 


1764 


13 


0 


14 
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SEQ ID NO: 


lib 15 clones 


lib 17 clones 


lib 1 5/lib 1 7 


SEQ ID NO: 


Lib 18 Clones 


Lib20 Clones 


libl 8/lib20 


105 


28 


11 


2 


198 


21 


0 


18 


254 


9 


0 


8 


412 


9 


0 


8 


1011 


11 


1 


9 


1138 


14 


0 


12 


1253 


23 


0 


20 


1643 


18 


0 


15 


1764 


12 


0 


10 


3156 


140 


43 


3 



Table 1 3 

Differentially expressed polynucleotides isolated from samples from two patients 
(patient 2 and patient 3): Lower expression in normal colon tissue (patient 2:lib 1 5; 
5 patient 3:lib 1 8)vs. high metastatic potential colon tissue (patient 2:lib 17; patient 3:lib 

20). 



SEQ ID NO: 


Lib 1 5 Clones 


Lib 1 7 Clones 


lib 17/lib 15 


321 


3 


23 


7 


363 


1 


9 


8 


836 


21 


99 


4 


859 


6 


20 


3 


885 


13 


28 


2 


916 


13 


28 


2 


981 


2 


11 


5 


1226 


8 


70 


8 


1308 


0 


8 


7 


1317 


29 


84 


3 


1429 


27 


127 


4 


1442 


0 


9 


8 


1534 


1 


12 


11 


1540 


12 


43 


3 


1552 


0 


7 


7 


1556 


1 


9 


8 


1557 


1 


9 


8 


1569 


2189 


5122 


2 


1571 


6 


18 


3 


1576 


3 


25 


8 
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bhQ ID NO: 


Lib 15 Clones 


Lib 17 Clones 


lib 17/lib 15 


1 C O 1 

1 581 


A 

4 


22 


5 


1 £ A 1 

1 601 


25 


157 


6 


1/11 

161 3 


9 


48 


5 


1/1/ 
1616 


15 


61 


4 


1 ZT O A 

1620 


2 


17 


8 


1622 


4 


99 


23 


1626 


6 


35 


5 


1647 


4 


22 


5 


1 f f A 

1664 


4 


28 


7 


1 O 1 

1683 


2 


18 


8 


1704 


3 


15 


5 


1800 


0 


7 


7 


2749 


23 


60 


2 


2784 


4 


14 


3 


2805 


1 


9 


8 


2976 


3 


14 


4 


3128 


18 


57 


3 


3129 


26 


124 


4 


3146 


64 


210 


3 


3150 


940 


2267 


2 


3151 


2 


15 


7 










SEQ ID NO: 


lib 18 clones 


lib 20 clones 


lib 20/lib 1 8 


865 


0 


5 


6 


1569 


1 


7 


8 


1580 


1 


7 


8 


1590 


1 


7 


8 


2790 


0 


5 


6 



EXAMPLE 9 

Polynucleotides Differentially Expressed in High Colon Tumor Potential 
Patient Tissue Versus Metastasized Colon Cancer Patient Tissue 

A number of polynucleotide sequences have been identified that are 
differentially expressed between cells derived from colon cancer tissue and cells derived 
from colon cancer tissue metastases to liver. Expression of these sequences in colon 
cancer tissue can provide diagnostic, prognostic and/or treatment information associated 
with the transformation of precancerous tissue to malignant tissue. This information 
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can be useful in the prevention of achieving the advanced malignant state in these 
tissues, and can be important in risk assessment for a patient. 

The following table summarizes identified polynucleotides with 
differential expression between high tumor potential colon cancer tissue and cells 
5 derived from high metastatic potential colon cancer cells: 



Table 14 

Differentially expressed polynucleotides: 
Greater expression in metastatic colon tumor tissue (lib 20) vs. 
] 0 colon tumor tissue (lib 1 9) 



SEQ ID NO: 


lib 19 clones 


lib 20 clones 


lib 20/lib 19 


937 


0 


6 


8 


976 


0 


5 


7 


1520 


1 


8 


11 


1546 


1 


11 


15 


1550 


1 


11 


15 


1574 


1 


8 


11 


1580 


0 


7 


9 


1590 


0 


7 


9 


1599 


8 


21 


4 


1607 


158 


632 


5 


1622 


1 


7 


9 



Table 1 5 

Greater expression in colon tumor tissue (lib 19) than metastatic colon tissue (lib 20) 



SEQ ID NO: 


lib 19 clones 


lib 20 clones 


lib 19/lib 20 


105 


64 


11 


4 


1011 


53 


1 


40 


1226 


18 


4 


3 


1571 


8 


0 


6 


1726 


15 


3 


4 


1811 


17 


2 


6 


2749 


47 


6 


6 


3146 


19 


2 


7 


3324 


20 


1 


15 
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EXAMPLE 1 0 

Polynucleotides Differentially Expressed in High Tumor Potential 
Colon Cancer Patient Tissue Versus Normal Patient Tissue 

5 A number of polynucleotide sequences have been identified that are 

differentially expressed between cells derived from high tumor potential colon cancer 
tissue and normal tissue. Expression of these sequences in colon cancer tissue can 
provide diagnostic, prognostic and/or treatment information associated with the 
prevention of the malignant state in these tissues, and can be important in risk 

10 assessment for a patient. For example, sequences that are highly expressed in the 
potential colon cancer cells are associated with or can be indicative of increased 
expression of genes or regulatory sequences involved in early tumor progression. A 
patient sample displaying an increased level of one or more of these polynucleotides 
may thus warrant closer attention or more frequent screening procedures to catch the 

1 5 malignant state as early as possible. 

The following tables summarize polynucleotides that are differentially 
expressed between high metastatic potential colon cancer cells and normal colon cells: 

Table 1 6 

Differentially expressed polynucleotides detected in samples from patient (patient 2) 
20 Higher expression in normal colon tissue (patient 2, lib 15) 

vs. tumor potential colon tissue (patient 2:libl6) 



SEQ ID NO: 


lib 15 clones 


lib 16 clones 


lib 16/lib 15 


69 


19 


7 


3 


105 


116 


54 


2 


140 


24 


4 


6 


197 


6 


0 


6 


198 


113 


3 


40 


254 


28 


6 


5 


412 


28 


6 


5 


642 


7 


0 


7 


830 


10 


2 


5 


938 


31 


13 


3 


1011 


209 


37 


6 


1095 


12 


3 


4 


1125 


17 


0 


18 
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SEQ ID NO: 


lib 15 clones 


lib 16 clones 


lib 16/lib 15 


1129 


17 


0 


18 


1138 


109 


1 


115 


1253 


73 


1 


77 


1283 


34 


13 


3 


1285 


34 


13 


3 


1339 


13 


3 


5 


1453 


11 


3 


4 


1474 


73 


1 


77 


1505 


18 


6 


3 


1554 


2542 


448 


6 


1605 


2542 


448 


6 


1614 


36 


14 


3 


1630 


24 


9 


3 


1643 


142 


2 


75 


1646 


39 


14 


3 


1649 


24 


8 


3 


1677 


19 


6 


3 


1753 


13 


0 


14 


1764 


13 


0 


14 


1766 


177 


65 


3 


1772 


24 


8 


3 



Table 1 7 

Differentially expressed polypeptides detected in samples from patient. Lower 
expression in normal colon tissue (lib 1 8) than colon tumor tissue (lib 19) 



SEQ ID NO: 


lib 18 clones 


lib 19 clones 


lib 19/lib 18 


3146 


3 


19 


6 


3150 


21 


228 


10 


3324 


3 


20 


6 
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Table 18 

Differentially expressed polypeptides detected in samples from patient. Higher 
expression in normal colon tissue (lib 18) than colon tumor tissue (lib 19) 



SEQ ID NO: 


lib 18 clones 


lib 19 clones 


lib 18/lib 19 


198 


21 


2 


12 


465 


6 


0 


7 


489 


6 


0 


7 


745 


6 


0 


7 


859 


11 


2 


6 


976 


7 


0 


8 


1011 


209 


37 


6 


1045 


8 


1 


9 


1138 


14 


0 


16 


1253 


23 


0 


26 


1392 


16 


4 


5 


1474 


23 


0 


26 


1589 


6 


0 


7 


1591 


22 


11 


2 


1607 


386 


158 


3 


1643 


18 


0 


21 


1753 


12 


0 


14 


1764 


12 


0 


14 










SEQ ID NO: 


lib 18 clones 


lib 19 clones 


lib 19/lib 18 


105 


28 


64 


2 


1011 


11 


53 


4 


1226 


2 


18 


8 


1251 


6 


19 


3 


1559 


1 


9 


8 


1571 


0 


8 


7 


1608 


1 


9 


8 


1766 


2 


13 


6 


1782 


1 


9 


8 


1811 


1 


17 


15 



5oD 
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Table 1 9 

Differentially expressed polynucleotides: 
Higher expression in colon tumor tissue 
(patient 2 S lib 16) vs. normal colon tissue (patient 2, lib 15) 





no ij clones 


11D IO LlOilCa 


lih 16/lih 1 S 

IIU l\J/ llV 1 


1 


l 


O 

y 


Q 

y 


1 HA 
164 


o 


1 Q 


j 


Ti A 


4 


1 3 


A 
*t 




Zl 


si 


Z 


92 & 


2 


1 1 
1 1 


c 
J 


965 


z 


1 1 


J 


987 


2 


1 1 
1 1 


C 

J 


1026 


7 


19 


1 
J 


1044 


4 


1 c 
16 


A 

4 


1119 


4 


lo 


4 


1226 


O 

0 


AC 

4o 


C 
J 


1227 


0 


A 

9 


A 

9 


1251 


7 


AC 

95 




1316 


0 


6 


o 


1429 


OT 

27 


5l 


J 


1 A A ^ 

1442 


A 
0 


A 

9 


A 

9 


1540 


lz 


zo 


z 


i 

1 jD3 


Aft 


jy\j 


c 

o 


1560 


4 


24 


6 


1577 


1 


10 


9 


1588 


5 


20 


4 


1610 


3 


13 


4 


1620 


2 


23 


11 


1626 


6 


23 


4 


1673 


2 


15 


7 


2416 


0 


7 


7 


2749 


23 


54 


2 


2976 


3 


14 


4 


3129 


26 


64 


2 


3132 


18 


54 


3 
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EXAMPLE 1 1 

Polynucleotides Differentially Expressed in Growth Factor-Stimulated 
Human Microvascular Endothelial Cells (HMEC) Relative to Untreated 

HMEC 

5 A number of polynucleotide sequences have been identified that are 

differentially expressed between human microvascular endothelial cells (HMEC) that 
have been treated with growth factors relative to untreated HMEC. 

Sequences that are differentially expressed between growth factor-treated 
HMEC and untreated HMEC can represent sequences encoding gene products involved 

10 in angiogenesis, metastasis (cell migration), and other developmental and oncogenic 
processes. For example, sequences that are more highly expressed in HMEC treated 
with growth factors (such as bFGF or VEGF) relative to untreated HMEC can serve as 
markers of cancer cells of higher metastatic potential. Detection of expression of these 
sequences in colon cancer tissue can provide diagnostic, prognostic and/or treatment 

15 information associated with the prevention of achieving the malignant state in these 
tissues, and can be important in risk assessment for a patient. A patient sample 
displaying an increased level of one or more of these polynucleotides may thus warrant 
closer attention or more frequent screening procedures to catch the malignant state as 
early as possible. 

20 The following table summarizes identified polynucleotides with 

differential expression between growth factor- treated and untreated HMEC. 

Table 20 

Differentially expressed polynucleotides: 
25 Higher expression in untreated HMEC (lib 1 2) vs. bFGF treated HMEC (lib 1 3) 



SEQ ID NO: 


lib 12 clones 


lib 1 3 clones 


Iibl2/libl3 


849 


6 


0 


6 


1059 


6 


0 


6 


1206 


12 


2 


6 


3208 


12 


0 


12 
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Lower expression in untreated HMEC (lib 12) vs. bFGF treated HMEC (lib 13) 



2748 


3 


12 


4 


3325 


0 


6 


6 



Table 21 

Differentially expressed polynucleotides: 
Higher expression in untreated HMEC (lib 12) VEGF treated HMEC (libl4) 



SEQ ID NO: 


lib 12 clones 


lib 14 clones 


lib 12/lib 14 


1150 


9 


0 


9 



5 

Lower expression in untreated HMEC (lib 12) vs. VEGF treated HMEC (lib 14) 



3324 


22 


50 


2 



EXAMPLE 12 

1 0 Polynucleotides Differentially Expressed in Normal Prostate Cells 

Relative to Prostate Cancer Cells 

A number of polynucleotide sequences have been identified that are 
differentially expressed between cells derived from normal prostate cells and prostate 
cancer cells. Expression of these sequences prostate tissue suspected of being 
15 cancerous can provide diagnostic, prognostic and/or treatment information. These 
polynucleotide sequences can also be used in combination with other known molecular 
and/or biochemical markers. The following table summarizes identified 
polynucleotides with differential expression between high metastatic potential colon 
cancer cells and low metastatic potential colon cancer cells: 

20 



^0^ 
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Table 22 

Differentially expressed polynucleotides: normal prostate cell line (lib 21) 

vs. prostate cancer cell line (lib 22) 
Higher in lib 21 



SEQ ID NO: 


lib 21 clones 


lib 22 clones 


lib 21 /lib 22 


53 


17 


2 


8 


1754 


22 


8 


3 


1801 


7 


0 


7 


1845 


22 


6 


4 


446 


8 


0 


8 


1410 


6 


0 


6 


2060 


18 


6 


3 


2143 


12 


3 


4 


2632 


13 


1 


13 


2899 


16 


2 


8 


3338 


12 


2 


6 



5 

Higher in lib 22 



86 


2 


13 


7 


93 


0 


9 


9 


687 


0 


9 


9 


1269 


1 


15 


15 


1581 


25 


74 


3 


1647 


25 


74 


3 


1649 


12 


27 


2 


1710 


5 


16 


3 


1717 


5 


16 


3 


1772 


12 


27 


2 


1960 


0 


6 


6 


2987 


0 


6 


6 


3128 


13 


42 


3 


3132 


13 


42 


3 


3150 


263 


962 


4 


3222 


0 


6 


6 


3268 


0 


6 


6 



5o>} 
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EXAMPLE 1 3 

Polynucleotides Differentially Expressed Across Multiple Libraries 

A number of polynucleotide sequences have been identified that are 
differentially expressed between cancerous cells and normal cells across two or more 
5 tissue types tested (i.e., breast, colon, lung, and prostate). Expression of these 
sequences in a tissue of any origin can provide diagnostic, prognostic and/or treatment 
information associated with the prevention of achieving the malignant state in these 
tissues, and can be important in risk assessment for a patient. These polynucleotides 
can also serve as non-tissue specific markers of, for example, risk of metastasis of a 

10 tumor. The following polynucleotides were differentially expressed but without tissue 
type-specificity in at least two of the breast, colon, lung, and prostate libraries tested: 
53, 105, 355, 412, 614, 836, 1442, 1581, 1647, 1649, 1664, 1772, 1782, 1811, 1845, 
1856, 1875, 1923, 2060, 2071, 2135, 2146, 2239, 2313, 2378, 2393, 2416, 2460, 2490, 
2632, 2674, 2704, 2724, 2749, 2784, 2804, 2959, 2976, 2977, 2980, 2987, 3009, 3047, 

15 3128, 3129, 3132, 3146, 3150, 3156, 3210, 3324, 3331, and 3335. 

Those skilled in the art will recognize, or be able to ascertain, using not 
more than routine experimentation, many equivalents to the specific embodiments of 
the invention described herein. Such specific embodiments and equivalents are 
intended to be encompassed by the following claims. 

20 All publications and patent applications cited in this specification are 

herein incorporated by reference as if each individual publication or patent application 
were specifically and individually indicated to be incorporated by reference. The 
citation of any publication is for its disclosure prior to the filing date and should not be 
construed as an admission that the present invention is not entitled to antedate such 

25 publication by virtue of prior invention. 

Although the foregoing invention has been described in some detail by 
way of illustration and example for purposes of clarity of understanding, it is readily 
apparent to those of ordinary skill in the art in light of the teachings of this invention 
that certain changes and modifications may be made thereto without departing from the 

30 spirit or scope of the appended claims. 

Deposit Information: 

The following materials were deposited with the American Type Culture 
Collection (ATCC); CMCC = Chiron Master Culture Collection: 
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cDNA Libraries Deposited with ATCC 



Tube Number 


Deposit Date 


ATCC 

Accession No. 


CMCC 

Accession No. 


ES137 


May 30, 2000 






ES138 


May 30, 2000 






ES139 


May 30, 2000 






ESI 40 


May 30, 2000 






ES141 


May 30, 2000 






ESI 42 


May 30, 2000 






ES143 


May 30, 2000 






ESI 44 


May 30, 2000 






ES145 


May 30, 2000 






ESI 46 


May 30, 2000 






ESI 47 


May 30, 2000 






ESI48 


May 30, 2000 






ES149 


May 30, 2000 






ES150 


May 30, 2000 






ES151 


May 30, 2000 






ESI 52 


May 30, 2000 






ESI 53 


May 30, 2000 






ESI 54 


May 30, 2000 






ESI 55 


May 30, 2000 




— 


ESI 56 


May 30, 2000 






ES157 


May 30, 2000 






ESI 58 


May 30, 2000 






ESI 59 


May 30, 2000 






ESI 60 


May 30, 2000 






ES16I 


May 30, 2000 






ESI 62 


May 30, 2000 






ESI 63 


May 30, 2000 






ESI 64 


May 30, 2000 






ESI 65 


May 30, 2000 






ESI 66 


May 30, 2000 






ESI 67 


May 30, 2000 







Table 23 lists the clones for each deposit, designated as "tube" number. 
5 This deposit is provided merely as convenience to those of skill in the art, and is not an 
adm ission that a deposit is required under 35 U.S.C. §112. The sequence of the 
polynucleotides contained within the deposited material, as well as the amino acid 
sequence of the polypeptides encoded thereby, are incorporated herein by reference and 
are controlling in the event of any conflict with the written description of sequences 
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herein. A license may be required to make, use, or sell the deposited material, and no 
such license is granted hereby. 

Retrieval of Individual Clones from Deposit of Pooled Clones 

Where the ATCC deposit is composed of a pool of cDNA clones, the 
5 deposit was prepared by first transfecting each of the clones into separate bacterial cells. 
The clones were then deposited as a pool of equal mixtures in the composite deposit. 
Particular clones can be obtained from the composite deposit using methods well 
known in the art. For example, a bacterial cell containing a particular clone can be 
identified by isolating single colonies, and identifying colonies containing the specific 

1 0 clone through standard colony hybridization techniques, using an oligonucleotide probe 
or probes designed to specifically hybridize to a sequence of the clone insert (e.g., a 
probe based upon unmasked sequence of the encoded polynucleotide having the 
indicated SEQ ID NO). The probe should be designed to have a T m of approximately 
80°C (assuming 2°C for each A or T and 4°C for each G or C). Positive colonies can 

15 then be picked, grown in culture, and the recombinant clone isolated. Alternatively, 
probes designed in this manner can be used to PCR to isolate a nucleic acid molecule 
from the pooled clones according to methods well known in the art, e.g., by purifying 
the cDNA from the deposited culture pool, and using the probes in PCR reactions to 
produce an amplified product having the corresponding desired polynucleotide 

20 sequence. 

Table 23 







s^iofphp^rf 




M00001351A:B02 


ES 337 | 


M00001356A:H11 


EST37 ] 


M00001363D:D09 


ES 137 "I 


M00001395D:H02 
M00001439C:H06 


ES 137 


ES 137 


M00001476B:G10 


ES 137 


M00001582A:E02 


ES137 


MO00O375OD:E06 


: ES 137 


M00003761C:F02 


!ES 137 


M00003770A:E05 


ES 137 


M00003786A:A1 1 


IES 137 


M00003800A:F09 


ES 137 


M060038T6D:E1 1 


EST37 " 


M00003902A:C03 


IES 137" 


M00003991C:F06 


ES 137 





:E:"Tube> : ;^- 


iM00003995B:E03 


ES 137 ; 
ES 137 
ES 1 37 " 
ES137 


| M00004046C:A08 


[M00004105D:b05 
pVK)0004]39B;B10 


jM00004140D:C03 


ES137 : 


;M00004144A:H05 
iM00^T52A:C12 


ES 137 ! 
ES 137 1 


!M00004155D:A10 


ES137 j 


|M00004168A:G11 


ES137 i 


jM00004197B:H10 
1 M00004222C:E03 


ES 137 . 
ES 137 ' 


i M00004234A:E07 
i M 0000423 9B:F1 1 
|M06604241B:H07 


ES 137 
ES 1 37 " 
ES 137" 


:M00004264B:A05 


lis 137 
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■-<r~-i Clone iNarne. . 




M0Q0042 /oA:rUy 


CC 117 
DO I J I 


M 000042 82 D:L 1 1 


CC 117 


M00004308C:C06 


cc m 
bo 13 / 


M00004340C:L07 


bo 13/ 


MAAAAdlSdn-FAS 

JVIUV/V/Wt J JH.L/. Cv J 


' FS H7 

C*0 1 J / 


M00004361A:H02 


ES 137 


M00004372B:F07 


ES 137 


M00004378A:B10 


ES 137 


M0O0O4393B:EO7 


;ES 137 


M00023282A:C02 


ES 137 


M00023300D.C11 


ES 137 


M00023316C.G08 


ES137 



M00023333D:C12 iES 137 



M00023352B:F03 I ES 137 



!M00023352D:H03 j ES 137 



M00023376B:G04 j ES 13 7 
jM000233WB:D12~ 



ES 137 



i ES 137 



! M00023399C:EJ[0 
M06626803A:F08 



ES 
ES 



137 
137 



— ( 



M00026843B:D10 ; ES 137 



M00026850D;F09 


j ES 137 


M00026851B:F01 


;ES 137 


M00026856D:F02 


jES 137 


M00026857D:G12 


;ES 137 


M00026859D:D01 


! ES 137 


M00026860B:C05 


ES 137 


M00026865B:A06 


j ES 137 




1 ES 137 


M00026878A:F05 


lES 137 


I M00026882D.G09 


;ES 137 


"M66626885A:H69 


! ES 137 


!Md0026901A:G07 


ES 137 \ 



M000269 1 4A:H1 0 jES 

iES 
iES 



M00026915B:C06 
j M Q002691 8B:b6T 
iM00026922C:B02 



137 
137 



137 



M00026922C:G03 



ES 
ES 



137 
137 



M00026926A:E10 i ES 137 



M00026927D:F02 iES 137 



M0002692 8D:A0 3 j ES 137 
M00026935C:B04" ' 



M0002694 1 D: A04 



ES 
ES 



137 



137 



M00026944B:E03 IES 137 
M00026946A:F12 !ES 137 









CO 1 J f i 


! \A(\(\(Y)1C\ 1 A A -RAY\ 


LO I J f 


; N/innftOTftl 8 A *PflO 
; IVIUUUZ f\J I of\.\->\ty 


cc i 17 1 


■ IVIUWx. I\J£. I /VvJVZ. 

i 


cc H7 ~1 


i MfifiA97n97n-n 1 1 


PC 117 

Co 1 J / ; 


; IV1UUVZ /V3UV^.rll/0 


Trc 117 ; 

CO l J / 


! K/IAAA77A1 ^TV/^AA 


CC 117 ; 
Co 1 J / , 


' h /lAAAOlA/lfiD .PAC 

M0U02 /U4Vt>. rUj 


CC 117 ; 
CO J J / 1 

1 


: lV/fAAA77A7W A -D.A7 


CC 1 17 "1 
Co 13/ 


IXAAAnAOA A .DAI 

M000270oOA :BO 1 


CC 1 1 7 * 

bo U / 


■ \A AAA77Aft<f~ l - C 1 1 


CC 117 

CO I J / ; 


1 1 ... - 

MAAAHAA/l A .DAI 

, M0002 /094A:B03 


CC 1 1 "7 

bo 1 i / 


■ \vlAAAT7 1 Al D • A AO 

. MUOOz / 1 UJrJ. AW 

i, . . „ 1 


CC 117 

bo 1 j / ; 

— j 


: M00027108C:B03 


bS 137 


M00027121D:C05 


ES"f37"": 


[M00027135A:B11 


ESI37 ; 


:M00027136C:C09 


ES137 


!M0002714IC:H03 


ES137 ] 


M00027159D:F03 


ES 137 : 


M0Q0271 62B:F05 


ES 137 ^ 


!M00027178B:G09 


ES 137 


j M00027 179D?E66 ]ES I38 


!M00027181D:A05 


ES138 ; 


|M00027195C:E04 


ES 138 


fM00027198B:B08 


ES 138 



M00027200A:F02 



ES 138 



f- 



M00027207B:F07 1 ES 138 



!M00027212D:E03 



iM00027228D:A01 
iM^027232D:Bq8 
!M00027233B*C0l" 



M00027236A:E04 



jM00027237C:B08 
[M06027248A:C02 
jM00027256B:H09 
SM00027258A:A07 



ESJ38 
"'"["ES 138 
j ES 138 
[ES 138 
IEST38" 
TES 138" 



ES 138 



|ES 138 



ES 138 



!M00027263A:F10 TeS 138 



M00027292D:F10 



•-I 



u. 



i M00027297A:C04 



:M00027299B:B12 
lMb0027301ATG05" 



IES 138 

L ESl38 
ES 138 



j M0002730TB^B08 
n^0027314C:D09 



ESJ38 
ES 138 



!M00027319D:B1J 
\ M000^3^D:C()5 



ES 138 



ES 138 
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|M00027347C:G07 



!M00027355A:B07 



!M00027359B:G05 



ES 138 



ES 138 



ES 138 



|M00027366A:F11 1 ES 138 



I M00027379C:B07 i ES 138 



I M00027392B:H02 iES 138 



! M00027396D:G08 IES 138 



IM00027398C:F07 1ES 138 



M00027438C:G07 ES 138 



M000 27462A :D07 jES 1 38 
M^27462B:H07 j ES 1 38 
~M00027468A:C09 j ES 138 



M00027475B:E10 ES 138 



M00027476A:C09 IES 138 



M00027486A:F06 !ES 138 



M00027520A:C05 ! ES 138 



M00027525B:D06 
:M00027526D:F03 



;ES 138 



ES 138 



M00027528C:B10 (JES J138 
M 0002753 7C:B01 1 ES 1 38 
M06027546C:B 10 " ! ESl 38 



M00027591j*£04 j ESJ38 
M00027596A:A1 0 iES 1 38 



M00027596C:E06 j ES 138 
M00027602B:C0 \ " TeS 13 8 
M000276i5A:F10 j ES 1 38 



M00027617B:C12 :ES 138 



M00027620D:F11 'ES 138 



M00027625A:H01 !ES 138 
M60027634A:DTf Tes738 
M 0002764 1C: A03 ES 138 
"M00027647C:D03 ] ES K38 
M00027652B:F11 i ES 138 

j ES 138 
IES 138' 



M00027668C:H12 
M00027729D:H06 



M00027733A:A02 iES 138 



M00027741B:F09 
MO' 0627743A:C 03" 
MOOO^'801C:Cn 
M066278T3C:F0f 



l ES iH. 
[ES 138 

j ES 138 

TES 138 



IM00027818C:C07 MES j38 
!Mib0027836D:F12 1 ES 1 38 



M00027837C:D09 ; E S 138 
^^^±20D:^}2 tESj38 
M00028066C:D07 " ! ES 138 







Mfi00281 84DG10 


ES 138 


M000281 8SRA06 

1VI V/v/\/Ji.O 1 OJU.rVvw 


ES 138 


M00028 1 96D* A03 


ES 138 


M00028201BH12 


ES 138 


M00028207D:E09 


ES 138 


M0002X210R-D02 


ES 138 i 


M00028212C:B08 


ES138 | 


|M00028215D:F03 


ES 138 ~~] 


|M00028220A:B04 


ES138 ! 


jM00028314D;F05 


ES138 | 


M00028316B:H12 


ES138 ; 


M00028354A:B12 
M00028354D:A03 


ES138 
fES 138 


M00028357A:G10 


ES138 



j M0002 83 64C:G08 
rM00028369D:E08 
!M00028617C:A12 



M00028768C:D05 



lM0003]368A:E10 
j M00031 41 7C:G09 
iM6O0314f9b:C04 



;M00031485D:G02 
M00632480B:EI0 



M00032492A:C01 
M000^495B:D6'2 
M00032499C:A01 



M00032508B:H03 



M00032510D:F12 



M00032510D.G06 
M6o6 325T3 D:Fbr' 
M0003253 (jD:C0 2 
M0O032535D:H0T 

M00032539B:C 1 1 " 
M00032540A:A09 
M0003254 i D:H08 



M000325 50D:C02 
M0003255 1_B:G05 
M06632577A:C04 



! M00028770A:D04 



M00028 772C:B09 j ES 
M00028775p:F03 ~"Te 
M66b28777B^Gl 2 "~Tf 



IES 



139 



ES 



| M00032545B:H09 
I M00032545D:G05 ~ | ES 



— t 
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*ubep| 


M00032578A:G06 


!ES 


139 


M00032584A:H08 


jES 


139 


M00032592A:H11 


;ES 

i 


139 


M00032597C:B01 


Ies 


139 


M00032638C:G08 


iES 


139 


M00032638D:A06 


j ES 


139 


M00032668D:G12 


;ES 


139 


|M06032678C:D06 


jES 


139 


tM00032688D:Dll 


IES 


139 


l"S^^327i"2BiG02 


1 ES 

> 


139 



ES 139 
ES 139 



I M00032724A:C05 
t" M00632 725C:F06" 
[m00032726C:C01 

| M00 Q3273 1 B:C10 ; 

1^003273 1 C:C07 ES 139 
!M0003^37B:E09 " ES 139 



;ES 139 
iES 139 



ES 139 
ES 139 



jjS400032739AjA06 

n^^ 27 44B:F10 ' 

|mQ0032766B:D12 TES_139 
SM06032766C:A64 ES 139 
M06'032790B:A67 jESJ 39 
M066"3^793A ~ ; ES 139 



M00032797B.G02 
"M00032808B:G 10 



M0003281 1B:P02 
^M00032829B:E06 



M00032830D:G03 
Mbod32831C:G07 



ES 139 
ES 139 
ESJ39 
ES 1 39 
ES 139 
ESl39 



M00032853D^G1^ JES 139 
1M00~032864B:B~Q9 " ,EST39 
j 1^003287 1 D:E1 \" ES 1 39 

M0b032876C^D06 ES 139 



M00032907A:G04 



M00032909A:B06 



ES 139 
ES 139 



M000329nD:G09 
MOOiM 



v. 



jM00032918B:E06 
M00032'9l8C:B10 



! M00032921B:H 08 jES 139 

fM(^d32^ 3A 0 jES 139 

|M00032939B:E07 j ESj 39_ 
\M06q^A0A:C62~ iES 139 



!ES 139 
1 ES 139 
[ES 139 
! ESl39 



|M00032942D:C12 
!M00032944B:B02 



; ESJ 39 
IES 1 39" 



! M00032984C:G05 iES 139 



Clone Name^ 



i M00032990B.A11 

(. ■ — 


ES 


139 j 


|M00032994A:A08 


ES 


139 | 


!M00032995C:C05 

i — 


ES 


139 ! 

— i 


!M00033007C:E01 


ES 


139 | 


!M00033019B:E10 


ES 


— — i 

139 ! 

< 


SM00033033C:H01 


ES 


139 i 


|M00033034C:A06 


ES 


139 j 


|M00033034C:F02 


ES 


139 j 


<M00033037D:C11 


ES 


139 1 


iM00033074A:C08 


ES 


-I 

139 i 

T 


!M00033130B:F06 


ES 


139 ! 


|M00033140D:F06 ' 


ES 


1 39 " I 


|M00033173D:C01 


ES 


139 ! 


}M00033176B:E12 


ES 


139 j 


|M00033186C:DU 


ES 


139 ! 


lM0b033T89D:F08~ 


ES 


139 j 


| M00033202D:G06 


ES 


139 j 


|M00033204B:A07 


ES 


"139" 1 



.-sr.- 



|M00033205A:F03 



; M00033217B :H07 
pVI000332 1 8ATc04 
Fm00033223B:H07 



iM00033226A:A11 



I 



ES 139 
E£f39 
ES 139 
jE S 139 
[ES 139 



M0003 3231D:BQ9 i ES 139 
1^6033231ip:G10 
M0b033243B:A05 



IES 139 
ES 139 



— 



j M00033246C:E08 ES 139 

| M000332 48A:B02 j"ES 139 

n^00033 26lC :D12 [ES 139 

; M 00033262 D: A 1 f~ jES 139 

f M0003 3263 B.G04 jES 139 



|M00033276B:G08 


ES 


139 


!M00033185C:D01 


ES 


139 : 


rM00033288B:D12 


ES 


140 1 


|M00033300D:H12 


ES 


140 ■! 


!M0bb33^6D:G08 


ES 




^M00033306D:H09 


ES 


140 


j MObb33308B-G05 


Is 


140 ; 


!M00033343C:Hb8 


ES 


140 | 


!M00033345D:A09 


ES 

, — — 


146 1 

i 


!M00033346C:A05 


ES 


I.I T.^ 

140 j 


|M00033347C:F02 


ES 


140 j 


\ M00033349D:F05 


ES 


140 j 


nMO0O33358A:Hi2 


ES 


140 1 


!M00033362C:C05 


ES 


140 ! 
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Clone Name ^ 



-Tube 



M00033375A:G04 i ES 140 



M00033376A:C12 I ES 140 



M0O033377D:AO5 ]ES 140 



M000334tOB:C09 [ES 140 



M00033424B:A04 |ES 140 



M00033424D:H12 ]ES 140 



M00033425A:C10 IES 140 



M0 003342 7D:F01 IES 140 
M00033432B:H10 " j ES 1 40 
| M06033437C:A07 llsl40 
M06033437C:C03 TeS 140 



M00033442A:D06 IES 140 



M00033446C:G08 ;ES 140 
M00033446D:B02~ IIS 1 40 



iM0003 3450C:A0 2 

iM00033451A:H6T 

jMb0033454A:D09 

|M 00033457D :A05 

IM00033560D™G07 

IM0003316TC7A02 



ES 140 
ES 140 
ES f40 
ES 140" 



ES 140 



ES 140 



|M00033566C:E08 
!M06W357^:C08T 
[^00033 570Bl 06~ 
rM06033570C:Cl0 
M00033578D:G02 



ES 140 
ES140 
ES 140 
ES 140 



i 



M00033581C:H10 



ES 140 
ES 140 



\ M000 33S81D:D0 8 
I'M 00033583 B^E0<T 
jM00^583b:B05 
1m60033584D:G1T 
[M00033585b-A02 



ES 140 
ES140 
ESJ40 
ES 1 40 
ES 140 




M 00033588C: G04 
!M00033594C:B03 



IM00033595A:C11 
! M00038259A:G08 
;M60038B9B:A02 



M00038259B:G08 



M00038259C:H09 



ES 140 
ES 140 
ISJ40 
ES 140 
ES 140 
ES 140 
ES 140 



t 

— 



M00038272A:G01 



M00038272D:F11 



M00038279C:A11 
I M00038284B:H04 
j M00038^3A:CO3 
iM66638WC:D02 



ES 140 
ES 140" 
ESJ40 
ES 140 
ES 140 



ES 140 



M00038303D:E07 IES 140 



CloneNarae 



M00038315C:G11 



M00038325D:F12 



iM00038326B:D04 



M00038327A:C11 



M00038327D:A05 



i- 



M00038328D:A03 



!M00038329A:E08 



|M00038387B:A07 



jM00038614C:H11 
i M000386 1 5 A:~H 12 



M00038616D:B12 



M00038618C:C08 



M00038619B:A03 



1M00038620B :EQ9 
M06038631C:B10 
M0663863 1 D:B02 



ES 140 



ES 140 
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CLAIMS 



We claim: 



1. A library of polynucleotides, the library comprising the sequence 
information of at least one of SEQ ID NO: 1-3351. 

2. The library of claim 1, wherein the library is provided on a nucleic 

acid array. 

3. The library of claim 1 , wherein the library is provided in a 
computer-readable format. 

4. The library of claim I , wherein the library comprises a 
polynucleotide corresponding to a gene differentially expressed in a cancer cell of high 
metastatic potential relative to a control cell, wherein the control cell is a normal cell or a 
cell of low metastatic potential, wherein the expression is greater in the metastatic tissue, 
and wherein the sequence is selected from the group consisting of SEQ ID NOs:14, 137, 
151, 152, 171, 200, 254, 262, 271, 348, 412, 472, 507, 520, 530, 588, 623, 637, 660, 678, 
680,700,714,774,812,834,901,937,976, 1168, 1333, 1352, 1520, 1524, 1546, 1550, 
1574, 1580, 1590, 1599, 1607, 1622, 1706, 1752, 1768, 1769, 1780, 1781, 1799, 1803, 
1811, 1851, 1856, 1867, 1872, 1875, 1884, 1919, 1923, 1939, 1975,2024, 2045,2060, 
2071,2118,2119,2128,2135,2177,2181,2184,2185,2190,2193, 2232, 2239, 2283, 
231 1, 2314, 2338, 2378, 2393, 2394, 2395, 2398, 2460, 2490, 2505, 2514, 2540, 2542, 
2597, 2607, 2640, 2657, 2669, 2670, 2674, 2679, 2684, 2707, 2724, 2757, 2776, 2804, 
281 8, 2906, 2959, 2964, 2968, 2976, 2980, 2987, 3010, 3043, 3047, 3050, 3071, 3072, 
3092, 3095, 3097, 3140, 3157, 3173, 3187, 3203, 3210, 3212, 3220, 3236, 3249, 3264, 
3284, 3288, 3305, 3309, 3318, 3330, 333 1, and 3335. 

5. The library of claim I, wherein the library comprises a 
polynucleotide corresponding to a gene differentially expressed in normal colon tissue 
relative to colon cancer tissue, wherein the expression is greater in the cancer tissue, and 
wherein the sequence is selected from the group consisting of SEQ ID M)s:7, 164, 734, 
836,928,965, 987, 1026, 1044, 1119, 1226, 1227, 1251, 1316, 1429, 1442, 1540, 1553, 
1560, 1577, 1588, J 610, 1620, 1626, 1673, 2416, 2749, 2976, 3129 and 3132. 

5<// 
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6. The library of claim I, wherein the library comprises a 
polynucleotide corresponding to a gene differentially expressed in normal colon tissue 
relative to colon cancer tissue, wherein the expression is greater in normal tissue than 
cancer tissue, and wherein the sequence is selected from the group consisting of SEQ ID 
lMOs:105, 198,465,489,745, 859, 976, 1011, 1045, 1138, 1226, 1251, 1253, 1392, 1474, 
1559, 1571, 1589, 1591, 1607, 1608, 1643, 1753, 1764, 1766, 1782, 1811,2749, 2784, 
2790, 2805, 2976, 3 1 28, 3 1 29, 3 1 46, 3 1 50, and 3 1 5 1 . 

7. The library of claim 1 , wherein the library comprises a 
polynucleotide corresponding to a gene differentially expressed in normal human 
prostate cells relative to human prostate cancer cells, wherein the expression is greater 
in normal cells than cancer cells, and wherein the sequence is selected from the group 
consisting of SEQ ID NOs:53, 446, 1410, 1754, 1801, 1845,2060,2143,2632, 2899, 
and 3338. 

8. The library of claim 1 , wherein the library comprises a 
polynucleotide corresponding to a gene differentially expressed in normal human 
prostate cells relative to human prostate cancer cells, wherein the expression is greater 
in cancer cells than normal cells, and wherein the sequence is selected from the group 
consisting of SEQ IDNOs:86,93, 687, 1269, 1581, 1647, 1649, 1710, 1717, 1772, 
1960, 2987, 3128, 3132, 3150, 3222, and 3268. 

9. An isolated polynucleotide comprising a nucleotide sequence 
having at least 90% sequence identity to an identifying sequence of SEQ ID NOs: 1-335 1 or 
a degenerate variant or fragment thereof. 

10. A recombinant host cell containing the polynucleotide of claim 9. 

11. An isolated polypeptide encoded by the polynucleotide of claim 9. 

12. An antibody that specifically binds a polypeptide of claim 1 1 . 

13. A vector comprising the polynucleotide of claim 9. 

14. A method of detecting differentially expressed genes correlated 
with a cancerous state of a mammalian cell, the method comprising the step of: 

detecting at least one differentially expressed gene product in a test sample 
derived from a cell suspected of being cancerous, wherein the gene product is encoded by a 
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gene corresponding to a sequence of at least one of SEQ ID NOs: 14, 137, 151, 152, 171, 
200, 254, 262, 271 , 348, 412, 472, 507, 520, 530, 588, 623, 637, 660, 678, 680, 700, 714, 
774, 81 2, 834, 901 , 937, 976, 11 68, 1 333, 1 352, 1 520, 1 524, 1 546, 1 550, 1 574, 1 580, 
1590, 1599, 1607, 1622, 1706, 1752, 1768, 1769, 1780, 1781, 1799, 1803, 1811, 1851, 
1856, 1867, 1872, 1875, 1884, 1919, 1923, 1939, 1975,2024,2045,2060,2071,2118, 
21 19, 2128, 2135, 2177, 2181, 2184, 2185, 2190, 2193, 2232, 2239, 2283, 231 1, 2314, 
2338, 2378, 2393, 2394, 2395, 2398, 2460, 2490, 2505, 2514, 2540, 2542, 2597, 2607, 
2640, 2657, 2669, 2670, 2674, 2679, 2684, 2707, 2724, 2757, 2776, 2804, 2818, 2906, 
2959, 2964, 2968, 2976, 2980, 2987, 3010, 3043, 3047, 3050, 3071, 3072, 3092, 3095, 
3097, 3140, 3157, 3173, 3187, 3203, 3210, 3212, 3220, 3236, 3249, 3264, 3284, 3288, 
3305, 3309, 3318, 3330, 3331, and 3335. 

wherein detection of the differentially expressed gene product is correlated with 
a cancerous state of the cell from which the test sample was derived. 

15. A method of detecting differentially expressed genes correlated 
with a cancerous state of a mammalian cell, the method comprising the step of: 

detecting at least one differentially expressed gene product in a test 
sample derived from a cell suspected of being cancerous, wherein the gene product is 
encoded by a gene corresponding to a sequence of at least one of SEQ ID NOs:7, 1 64, 
734, 836, 928,965, 987, 1026, 1044, 1119, 1226, 1227, 1251, 1316, 1429, 1442, 1540, 
1553, 1560, 1577, 1588, 1610, 1620, 1626, 1673, 1960,2416, 2749, 2976,2987,3128, 
3 129, 3 1 32, 3 1 50, 3222, and 3268. 

wherein detection of the differentially expressed gene product is correlated with 
a cancerous state of the cell from which the test sample was derived. 
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aagagagatc ctaaagaaag caaaatcact 390 
<210> 1144 <211> 458 <212> DNA <213> Homo sapien 

ggcacgagga gagagagaga gagagagaga gagagagaga gagagagaga gagagagaga 60 

gagagagaga gagagagaga gagagagaga gagagagagc gagagagaga gagagagcgc 120 

gagagagaga gatatatata tatctctcgc gctcgcgcgc gctctctctc tctttttttc 180 

tcttttgcgc gatttctctc gcgccccccc ttctctctct ctctctctct ccctctctct 240 

ctctctctct gtctctcact ctctctcttt cttttttttt ttatacactc tctctctctc 300 

tccctctctc tccctctccc tctttgtttc tcccgcgaga tctgtgtctc ttctttttgg 360 

gaagacaccc tctctctccg ccccctcttt gcgccttttt gagacacccc cccccctctc 420 

ctccctcttc tttttctcgg gggcttctcc cgtctttt 458 
<210> 1145 <211> 391 <212> DNA - <213> Homo sapien 

tacggctgcg agaagacgac agaagggaca ataccgcatt ataaagattg tgagaggtag 60 

ctgtttgtta atgtccaaat ctcaaccaaa gagtacaata catacaaaat attacagtga 120 

catggcctaa gtaaaaaaaa aaaacttaaa actgtcggaa aacacccatg aaaataaaga 180 

ggtccacatt aattttaaaa acctaaccta aatgggaaca caggtaccta tttaaattcg 240 

gaaaaaaaca gaacatcagg taaaggatga aaaatatatt agaatttatg gaggtggaaa 300 

atggaaatag aaataatccc tgtggccagg cgeagcggcc catgtctgta gtcccagcac 360 

cttgggagtt gaggcggcag acacttgaac c 391 
<210> 1146 <211> 391 <212> DNA <213> Homo sapien 

tacggctgcg agaagacgac agaaggggga tagcacttta tgacaaaata ggactgattt 60 

ttaaatttag cagaattatc tatggattgt ctagatctca gctgatatta aaatataact 120 

atattcaagt atttcattag attaaagagc agaggataag gctgaattta aaaattctat 180 

atctcggccc ggtgcagtgg ctcactcctg taatcccagc actttgggag gccgaggcag 240 

gcggatcact tgaggtcagg agttcgagac cagcctggcc aacgtggtga aacaccatct 300 

gtactaaaaa aacttcactg ggcgtcttgg cgcacgcctg taatcccagc tactggggag 360 

gctgaggcat gagaatcact tgaacctgag a 391 
<210> 1147 <2ll> 456 <212> DNA <213> Homo sapien 

tcttttggcc gaagcggcct acggctgcga gaagacgaca gaaggggtct gttggattca 60 

aaattttgta agccattttc acaagtacaa agatacattt taaccttgtc ttctccaaaa 120 

ttactgagta ggaattttat ttttatcttt ttgagacggg gtatcactgt cacccacact 18G 

ggagtgcggt ggtgggatct tggcttacr.g tgacctctgc ctcccgggtt caaatggtcc 240 

tccctcctea gtctcctgag tggctgggac ggcaggcgcg tgccaccatg cccagctaat 300 

ttgttctatt ttttctgtag agacggagtt ttgccatgtt gcccgggctg gtctcagact 350 

cctgagctca ggcgatcatt tcgcctcggc ctcccagagt qctgagattg gaggtgtggg 420 

ccacac^ratc tggcccagag tgaggagaat catgag 456 
<210> 1148 <211> 385 <212> DNA <213> Homo sapien 

tacggctgcg agaagacgac agaagggcat tcattatcag gaagttttag ttatctgtca 60 

tttttttttt tcacatcagt ttgatcagga aagtgtataa cacatcttaa agcaagagtt 120 

agtttggtat taaatcctca ttagaacaac cacctgtttc actaataact tacccctgat 180 

gagtctatct aaacatatgc attttaagcc ttcaaattac attatcaaca tgagagaaat 240 

caccaccaaa gaagatgttc aaaataatag tcccatatct gtaatcatat ctacacgcaa 300 

tgttagtaat tctgaagttt tttaaattta tggctatttt tacacgatga tgaattttga 360 

cagtttgtgc attttcttta tacan 385 
<210> 1149 <211> 383 <212> DNA <213> Homo sapien 

tacggctgcg agaagacgac agaagggggg taagggatgt tcacttcaga agtcattggg 60 

gagtttccag cgatgtcaca tctgactacc ccggcatgga tatatagaat ctcttggatc 120 

ataccatcct ggctct.gttc cagatttagt ttgcctgttt gatctt.gagt acttttgttt 180 

tgttttgttt ttgagacgga gtcttgctcg gtcacccagg ctggagtgca gaggtgtgat 240 

ctcggctcac tacaacctcc accucccggg tttaagcgat tctcccgttc agtctcccgg 300 

gtagctgtga ttacaggcac ctgccatcat gctcaggtag tttttgtatt tttgtaacga 360 

cggggtttca ccacgttggc cag 383 
<210> 1150 <211> 381 <212> DNA <213> Homo sapien 

tacggctgcg agaagacgac agaaggggga agagaggcca ctttttctgc atttctctga 60 

gccctccctg gggcaagtgt cctctcacat ca.cacccatc tctacccagc agaaggctct 120 

cactcaagga ctgccaagga taacnaattc aagacccatc ccacccacta ggtgccaaaa '!.8 0 

agctagcaag tcagctacct aataggcgtc ctttqagaca ttcaacacac atagatttaa 240 
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